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vi PREFACE 

an extensive battery of tests. The battery arranged therefore included tests 
of the simple speech processes; of more complicated processes requiring 
language facility and “verbal intelligence"; of attainment in reading, writ- 
ing, spelling, and arithmetic; and of performance on non-verbal tests de- 
manding intelligent behavior. On the basis of these tests, studies were 
made of 234 aphasic and control cases. The results for the aphasic and the 
apraxic or agnosic cases, as well as for the control cases of unilateral cere- 
bral lesion without these disorders, have been reported in a separate vol- 
ume." In every case the abnormal performances were interpreted in the light 
of the behavior of the normal subjects studied by the same methods. 

The work with the normal adults was begun in 1929 by Mildred Willard 
Gardiner, MA, who examined approximately half of the subjects. The re- 
mainder were examined by Anne Roe, Ph.D. 

In order to make satisfactory comparisons between normal and abnor- 
mal individuals, the groups had to be as nearly as possible equivalent i a age 
and in educational, occupational, and social status. The majority of the 
aphasic and other cases with cerebral lesion were studied in three Philadel- 
phia hospitals: the Orthopedic Hospital and Infirmary for Nervous Dis- 
eases, the Graduate Hospital of the University of Pennsylvania, and the 
Philadelphia General Hospital. However paradoxical it seemed, it was 
clear that the stxalled normal adults would have to be selected from among 
the patients admitted to the same hospitals, for only thus could a random 
selection from similar levels of the population be obtained. As will be de- 
scribed in Chapter II, the normal cases were chosen from the surgical or 
orthopedic wards and were accepted for study only if the record was free 
of neurological or mental disorders and the physical condition satisfactory 
for good work. 

For permission to study the patients selected for the normal group the 
authors are indebted to Dr. William J. Taylor, the late Dr. A. P. C. Ash- 
hurst, Dr. A. Bruce Gill, and Dr. Ralph Pemberton at the Orthopedic Hos* 
pital and Infirmary for Nervous Diseases; the late Dr. J. B. Carnett, P r - 
W. G. Elmer, Dr. W. E. Lee, and Dr. H- L. Bockus at the Graduate Hos- 
pital; and to Dr. E. L. Eliason, Dr. H. R. Owen, Dr. M. Behrcnd, P r - 
J. O. Bower, Dr. J. T. Rugh, and Dr. H- M. Righter at the Philadelphia 
General Hospital. Dr. W. G. Turnbull at the Philadelphia General Hos- 
pital very kindly made the necessary arrangements there. The authors ate 

*T. Webeaburg and K. E. McBride, Aphasia- A Clinical and Psychological Study. 
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changes in mental functioning. From the point of view of work with the 
normal, much of the material is particularly timely for it can be applied in 
the adult education projects which are now assuming such a prominent 
place in American life. In many cases adults, like children entering a 
school, must be given preliminary tests to provide for a satisfactory group- 
ing of individuals of different levels of development. In all cases prelimi- 
nary examination is required to determine their comprehension in read- 
ing, their language facility, and various other factors on which their suc- 
cess in the work will depend. From the point of view of studies of the 
abnormal, it must be noted pardcularly that standards of intellectual per- 
formances for the "average man” are still little known and that they have 
not before been determined for most of the individual tests included in 
this battery. Yet some of these are tests of primary importance in work 
with the abnormal, where the purpose is not only the determination of the 
intellectual level, but the qualitative analysis of the patient’s performance, 
his methods of work, and his attitudes. 

The monograph also contains analyses of the results for their bearing on 
a number of important problems: the comparative findings for the groups 
of men and women, the relationship of age and test performance, the fac- 
tor of education, the differences in test performance in certain occupa- 
tional classes, the relative standing of the individual on a wide variety of 
tests, and, finally, the relationship between the different test performances 
for the group as a whole. The findings on all these questions receive gen- 
eral discussion in Chapter IX, and the reader who is interested in the 
major contributions of the study rather than the detailed results is re- 
ferred especially to this chapter. 

It had been Dr. Weisenburg’s plan to write the preface for this mono- 
graph, and in it to stress the practical significance of the results for the 
psychiatrist, the educator, and the psychologist. With his death, before the 
final draft of the book was completed, the preface had not yet been writ- 
ten. I have included the points he had intended to cover, in so far as I 
know them, but I deeply regret that the work has not had the benefit of 
his final criticisms, for one of the qualities which made him great as a 
director of research was his unusual ability to synthesize the results of a 
scientific investigation and foresee both their theoretical and their practical 
import. 

K. E. McB. 


April J9JS 
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A CRITICAL SURVEY OF STUDIES RELATING TO 
ADULT INTELLIGENCE 

1. Introduction 

2. Development of Mental Testing in College Groups 

3. Intelligence Tests in the Army 

4. Intelligence Tests in Industry 

5. Sex Differences in Intelligence Test Performances 

6. The Growth and Decline of Intelligence 


I. INTRODUCTION 


W ITHIN the last few years the problems of adult intelligence have 
for the first time in the histoiy of "mental testing” been receiving 
the study their importance merits. It is true that they came into great promi- 
nence during the War when Yerkes and others undertook the colossal task 
of arranging intelligence tests for all grades of adults from the dullest to the 
most gifted, and the further task of having these tests quickly and well ad- 
ministered and interpreted. 1 But the primary purpose of the Army survey 
was a rough determination of the grade of intelligence for each of the 
nearly two million men tested, and there was no time to give to finer points. 
Thus while the Army data demonstrate the general mental level of the 
average man to the comparative satisfaction of the psychologist, if not of 
the layman, they do not provide material for study of the further problems 
which are coming to occupy the center of the field. The question of the re- 
lationship of different mental abilities, which attracts so much research at 
the present time, has been studied on the basis of the Army data, notably 
by Spearman, Kelley, and Brigham; but most investigators agree with 
Kelley that the material is not really suitable for such analysis. 2 The ques- 
tion as to the course of different test performances with age cannot be solved 
on the basis of the Army findings, for at different ages there were probably 
differing selective factors which made the various age groups incomparable. 
The recent studies of adult intelligence have yielded important data on 


* R. M. Yerkes, Psychological Examining in the Veiled States Army, Mem Nit. Acad. 
$«L, 1911, 15. 

*T. L. Kelley, Crorsroadt in the Mind of Sian, 193S. 215 
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L relationship between age and various test performances, tte two out- 
rending contributions are the large survey of a New Engbmd community 
made by Jones and Conrad’ by means of the Army Alpha Test and the 
Maturity Study made by Miles* at Stanford. The first is unique among 
studies of adults because it is a survey of an almost complete community. 
For the riven racial and cultural group it yields decisive information on the 
tvoical adult performance and the range of performances Furthermore, m 
as the right sub-tests of the Army Alpha permit, the study furnish* 
on the eharaeterisucs of different types of test performance. The lengd. 
‘ , . examination had of course to be limited to make possible the testing 
of lame numbers at one sitting and to ensure their cooperation. 

Th e S Maturity Study at Stanford is also based on large numbers of adults, 
hot they arc probably less representative of the populauon at large than 
b “ ' irnined by Jones. In the firs, testing program in 1930 approximately 
e h Took die brief ntelligence test, a fifteen-minute form of the Ons Self- 
AdministeringTest, andri addition certain brief tests of other fiincnons, 
perception, motility, memory, and imagmauon. Otto large 
s mJs were given various tests in the following years. The test- 
F r as a whole wL more extensive than that of any other mvesuga- 
! " S '’US be noted however, first, that the attempt .0 cover a large num- 
£ rf different types of performaneem 0 ° f 

S’KSSSi must ^interpreted sepanttely, for not all of the groups 


-.rtMTAI TESTING IN COLLEGE CROUPS 
2. DEVELOPMENT OF MENTAL 

u hrine out individual differences in mental 

Test situanons whteh would tag" jciencc Dnc recalI , Gidran - S 
traits arc by no means inventions a - ^ attack on the Midianites, 

method of reducing his force of 10,000 

„ . Tkr Growth atti Dedine of lntdhscr.ee, Cenet. PayehoJ. 

• H. E. Jones and H. S. Conrad, 

vionoe . 1033. 13. 2*3-*9 8 - ^ . 0 f intelligence Scorer and Chronological dge 

M Tc 2, aand W. R- US'* The ***£*£ £ 4t jS; W. R. Mik* Hu- 

, r-L to late Maturity, Amer- J- ‘ 7~ 
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STUDIES OF ADULT INTELLIGENCE 
latcd with themselves to a considerable degree, did not correlate with the 
mental tests. The almost complete lack of correlation was a serious blow to 
the laboratory mental tests, for it seemed obvious that the grades were in 
large part dependent upon intelligence. 

Need for the development of more satisfactory tests then became even 
more evident, and gradually, with increasing work on the more complex 
processes, such tests began to appear. In 1910 Goddard revised the Bine: 
Scale for use in the United States. In 1913 Calfce, Waugh, Bingham, Bell, 
Kitson, and others reported the findings for college students on complex 
tests such as directions, sentence completions, mixed relations, and logical 
memory. All this work has been well reviewed by Carothers as a prelimi- 
nary to a report of her own study of 200 Barnard freshmen in rprj and 
1916. 13 She gave nineteen tests which differed considerably in their nature 
and complexity from those Wisdcrbad used fifteen years before. Carothers' 
tests fell, according to their intercorrelations, into five groups: 


I. Coordination and Tapping 

II. Cancellation, Checking, Color Naming, Word Naming, and Substitution 

III. Directions, Opposites, Verb-Object, Mixed Relations, Word Building, and 
Completion 

IV. Word Recollection, Word Recognition, Logical Recollection, and Logical 
Recognition 

V. Information and Vocabulary 

Miscellaneous: Digit Span and Knox Cube 


It is interesting to note that the author looked for a general common fac- 
tor but found no evidence of one. Her correlations with a composite scar 
of all the groups of academic subjects were higher than Wisslcr s, a natural 
result of the types of tests used. They ranged from to "h- 2 /’ 

In 1918, largely as a result of the successful use of the Thorndike Test 
Mental Alertness with the Student Army Training Corps, the Columbia 
faculty voted an alternative method of admission , to include an intelligence 
test. 17 From that time on intelligence tests became more and more impor 
tant as supplementary criteria, and sometimes constituted the chief criterion 


** F. E. Caratbas. Psychological EiaminaZoas of CoBcge SttJem. An*- PsyvicL «9*-» 
6, No. 46. _ 

51 She called tin Composite score "Intdligence quotient.** _ _ , j„. 

Wood made an enthusiastic report on tie results of ietetligaws 0 sdogat 
lag the next fire jears. B. D. Wood. Sfeosurenent in Higher EJutaXon. 
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for college admission. The Army Alpha and the Thorndike College En- 
trance Examination were the most widely used, particularly in the earlier 
years. At present some colleges have their own tests, modeled largely 
upon these two. The colleges which are members of the College Entrance 
Examination Board require for admission not only subject achievement 
tests but also the Scholastic Aptitude Test, the first form of which was con- 
structed for the Board in 1926 by Brigham, Angier, McPhaill, Rogers, and 
Stone. This test, which at first had nine sub-tests, has been subjected to 
more extensive and elaborate statistical analysis than any other ever intro- 
duced, and has been continuously modified. At the present time it consists 
of two sections, verbal and mathematical. The verbal section contains three 
sub-tests: 1) Antonyms, four words, among which a pair of opposites is to 
be found; 2) Double definitions, a modified completion test; and 3) Para- 
graph reading. The mathematical section consists of 100 items. 18 The large 
population (8,000 to 9,000 a year) to which these tests have been given is, of 
course, a highly selected one, and the results show only the intellectual char- 
acteristics of the young adult of superior mental ability. 

3. INTELLIGENCE TESTS IN THE ARMY 

The first and largest study of adult intelligence ever undertaken was 
made in the United States Army during the World War. It was a radical 
innovation in military procedure. In the years just before the War there had 
been suggestions in both France and Germany that mental test methods 
would aid in the selection of men fit for military service. The French au- 
thorities, however, saw no merit in the scheme which Bmec and Simon 
proposed in 1910, 19 and apparently no extensive use was made in Germany 
of any of the procedures suggested for determining the intellectual status 
of conscripts and eliminating the low grade. 20 

When the United States entered the War on April 6, 1917, a group of 
psychologists then meeting at Harvard appointed a committee to gather in- 
formation on the part psychology could play in the preparations for war. In 

18 He tests are not arailab'e for reference, but yearly reports are published; the reports 
through 1931 are reprinted in C. C. Brigham's book, A Study of Error. 

18 A, Binet and Th Simon, Stir la neeettui d" une methoie applicable an diagnostic Jet ar- 
rihfet mtlttairet, Anaales mMico-psycboI., 1910, 11, 411-465. 

20 Anonymous, The Experience of the German Army with the Defective end Feeble- 
minded, J. Psycho-Aslhemcs, 1911, 16, 68-76. 
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the early summer a group mental examination was arranged by Yerkes, 
Bingham, Goddard, Haines, Terman, Wells, and Whipple, and tried out 
with such dispatch that by August first a report embodying the plan for ex- 
tensive testing and presenting the results of the trial examinations was in 
the hands of the Surgeon General. A detailed review of the problems in- 
volved has been presented by Yerkes in the fifteenth volume of the Mem- 
oirs of the National Academy of Sciences. 

Between September 1917 and January 1919, 1,726,966 men were exam- 
ined. 21 Most of them were given the Army Alpha, some the Beta, and some 
were also given a series of individual tests— the Stanford Binet and a long 
scries of performance tests. The tremendous number of records obtained on 
the Alpha Test makes it of greatest interest for any study of adult intelli- 
gence. The test in its final form was composed of the following sub-tests, 
the origins of which are also given below r 2 


Oral directions: adapted from A be] son, Otis, Woodworth, and Wells. 

3. Arithmetical reasoning. 

3. Practical judgment: adapted from Binet “comprehension questions” and 
Bonser “selective judgment” tests. 

4. Synonym-antonym: Otis test. 

5. Disarranged sentences: Otis adaptation of the Binet test. 

6 . Number series completion: adapted from Rogers* missing number test. 

7. Analogies: Otis, Bingham, and Thurstone form of test. 

8. Information: suggested by Wells, Bingham, and Whipple. 

The distribution of intelligence in the United States, as indicated by the 
Army results, showed a marked positive skew, a fact which has led to con 
troversy as to whether the true distribution of intelligence, which is appar- 
ently normal for children, was actually skewed in adult groups, or wheth^ 
the results depended partly on factors in the sampling. Terman contend 
that the Army draft was not representative because, in his opinion, a large 


n Four groups were chosen tor statistical analysis- 
and a selected group in permanent organizations- These grotips 


i- white dnfi, Negro draft, white oScers, 


35,391 for Negro retn“J*- 


e added those of a special e 
grand total to 162,536. 

11 Mem- Nat. Acad. ScL, 1911, »5. 3 00-302- ««! these 

The original test (Army test a) consisted of ten sub-tests. After a preliminary^ ,, 
were cut to eight sub-tests with a total of 21a items, and the tests retained were a . 
vised to correct faults found in the original series- 
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proportion of those granted exemptions were men of superior intelligence. 23 
Lincoln disagreed and presented arguments to show that the exempted 
groups were not superior. 21 There are undoubtedly points on either side, 
but it seems probable that Lincoln’s position is the more justifiable and that 
the sample is fairly representative and on the whole the most satisfactory 
ever obtained. 

The median Army Alpha score for the native-born white men was 58.9, 
while that for the foreign-born white was 46.7, and that for the Negro con- 
siderably lower (38.6 for Negroes from northern states and 12.4 for Negroes 
from southern). On the basis of Stanford Binet Tests of 653 men, Army 
Alpha scores were transmuted into Stanford Binet equivalents. The cen- 
tral result of this analysis, namely, that the intelligence of the white draft 
was not much above a thirteen-year level, is too well known to necessitate 
restatement. It has not only been discussed pro and con by psychologists, 
but has been seized upon by popular writers who have made good copy of 
the intellectual level of the average man. The height of the popular indig- 
nation was reached in Walter Lippmann’s papers in the New Republic in 
1922 and 1923. 29 They expressed many criticisms of the mental testing 
movement which psychologists in general would reject, but they sounded a 
note of caution which psychologists as well as laymen were finding ex- 
tremely necessary, namely that the mental-age method of expressing intelli- 
gence test results was likely to be misleading when applied to the scores of 
adult subjects. 

Tetman contended that the group of 653 men given the Stanford Binet 
Scale was not representative of the entire draft, while Lincoln found that it 
had a distinctly higher percentage of A and B men (as determined by 
Alpha scores) . The question is of little importance in relation to that of the 
character of the draft as a whole, for there is nothing to be gained by trans- 
muting Alpha into Stanford Binet Scores, but it seems clear that the groups 
on which the Army standards were established were more nearly repre- 
sentative of the population at large than those on which the Stanford Binet 
norms were based. 

This point was clearly made by Symonds in an article in which he re- 

21 L. M Term an. Mental Growth and the I. Q , J Educ. Pjychot, 1911, 12, 325-341. 

>* £. A. Lincoln, TAe Mental Age of Adttltt, J. Edue. Res . ig«, 6, 133-144 

*» W. Lippmann, The Mental Age of Amertcant, New Republic, 192s, 32, 213-215, 3 ,g_ 
* 4 ®. 275-277. 297-298* 328-530; 1923. 33 . 9 -io 
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ported a second approximation to die curve of the distribution of intelli- 
gence of the general population. 28 Assuming the normality of the distribu- 
tion in each occupational group, be weighted the distributions for different 
groups in the Army according to the numbers allocated to each by the 1910 
census. He found that the total curve thus obtained was strongly skewed to 
the right; and that “the numbers in low intelligence occupations . . . out- 
weigh the numbers in high intelligence occupations." His final Alpha me- 
dian was 48, and sigma 43. These figures corresponded almost exaedy to the 
figures obtained for the Army data when medians for native-white, foreign- 
born, and Negro groups were weighted and combined. Symonds’ com- 
ments are particularly arresting: 

Where shall we go for the average man? The average man has an Army 
Alpha score of about 48. Fryer's table [see page 48] shows that representative 
occupations around this level are masons, hospital attendants, station agents, 
miners, teamsters, riggers, boilermakers, airplane workers, factory storekeepers, 
horse shoers, salesclerks, hosders, barbers, stationary engineers, cobblers, horse 
trainers, caterers, bricklayers, auto truck chauffeurs, farmers, concrete workers, 
printers, and bakers. To obtain representative figures children and adults should 
be tested coming from social groups of which the above listed occupations are 
typical, not high school students and business men- There am as many in the 
population who are less intelligent than the average semi-skilled workmen in 
the occupations above as there are of those who arc more intelligent. Every per- 
son who wishes to obtain norms or standards representative of the total popula- 
tion can do no better than to consider carefully the occupation groups in which 
he proposes to do his testing. There seems to be no better ready criterion. 

Another important study based on the Army material, and appearing in 
the same year as Symonds’ article, is Brigham’s analysis of the mental levels 
in the various racial groups. 27 From the point of view' of this research the 
interest lies not so much in the actual findings for the different racial groups 
then compared, as in Brigham’s later criticism of the work and of the 
Army Alpha Test in general. 8 * This criticism is twofold, the first point be- 
ing that a tetrad analysis of the sub-tests show’s disparate group factors and 
the second that scores in different sections of the Alpha Test were derived 
from different sub-tests. For these reasons Brigham concluded that the 

*®P. XL Symoadt, A Second Approximanon to the Curre of the Krtn*nJ»« of Intelli- 
gence of the Population of the United Stater, with • Safe on the Standardisation of the 
Binet-Simon Scale, f. Edue. P.ycbol, 19JJ. l«. 65-81. 

* T Brigham, A Study of American l xtcliigence. 

** Brigham. Intelligence Tent of Immigrant Ceompt, Pijdiel. Rtr, *93®. 37. I5S"*®3- 
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scores in the eight sub-tests should never have been added to obtain a total 
score. 

The reaction of the Army examiners to the fact that the different sub-tests did 
not contribute proportionately to the score at different levels was just the reverse 
of Brigham’s . 20 They stated: 

The alpha examination itself is not a homogeneous scale, but a composite 
of eight short scales, each much more nearly homogeneous than the composite 
total alpha. Alpha must be treated as a composite for two reasons: (t) Certain 
of its component tests are very much more difficult than others, so that ex- 
aminees of less than average intelligence do not in general register their abil-' 
jty In all of the tests. (2) Owing to the fact that the total time allowed for the 
alpha examination is rigidly apportioned among the eight tests, . . . the col- 
lection of the eight tests into an “examination” is mainly a matter of admin- 
istrative convenience, with relatively litde implicau'on of quantitative or quali- 
tative equivalence. It follows from what has just been stated that subjects who 
are able to score in all of the alpha tests obtain total scores that are not com- 
parable with total scores earned by individuals who failed in some of the tests. 
The reason for this difference is that in obtaining an individual’s total scoie by 
adding together the scores in each test, scores of 0 are treated arithmetically 
the same as other scores. . . . 

The fact that zero points are disposed at different levels of intelligence For 
different tests would be of no consequence if negative scores could be regis- 
tered; but since they can not be, the individual who fails to earn a positive 
score and is marked zero is actually thereby given a bonus varying in value 
directly with his stupidity. 

It should be admitted that in general practice, similar scores have been taken 
to represent similar performances on the tests, but it is difficult to see how this 
situation could invalidate the constants of a frequency distribution obtained 
from large numbers of cases. 

As to Brigham’s first criticism, that there are disparate group factors in the 
test, one may point out that “g” from a statistician’s point of view and general 
intelligence from a clinician’s point of view are different concepts. Brigham 
stated the situation clearly when he wrote : 30 

Factors have figured so largely in the work of Spearman, Kelley, and others 
that it is important to understand their nature. Factors are not to be regarded 

** Mem. Nat. Acad. Sd., 1921, 15. 621 

In determining interrelationships between the tests, the effects of the piling up of zem 
sawes in Certain of the sub tests were discounted by statistical procedures which rnhstnbuted 
these case* to accordance with the normal correlation surface This procedure, of course, Ad 
not adjust the total score for any individual. 
so Brigham, A Study of Error, 3S. 
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as psychological realities, or existences, but merely as certain decimal multi- 
pliers which in a gi\ en system might be used to account for certain correlation 
coefficients. One should hunt for factors in the realm of roots, relations, and 
postulated existences and not in the central nervous system or the intrcspecthe 
report. The materials from which /actors are derived are coefficients of correla- 
tion between tests. 

The flaws in the Alpha Test and the difficulties in its interpretation are 
not sufficient ground for throwing the Army results out of court. They are 
not suited to many of the types of analysis one might wish to make, but 
they still represent the most satisfactory, and indeed the only index of the 
intelligence of the American population at large. The importance of the 
Army Alpha has recently been emphasized by Wells, who revised the test 
for the Psychological Corporation.* 1 He expressed his belief that the old 
norms were still the best available and so arranged his revision that the new 
test would be directly comparable to the old. His views on the character of 
the test are evident in the following statements: 

The numerous and active progeny which have issued from Army Alpha dur- 
ing its decade and a half of existence, may Bad the informed reader somewhat 
puzzled as to the reason for its rejuvenation. The reasons center around the con- 
siderations that for all its technical defects it is still much the most widely stand- 
ardized of '‘intelligence'’ tests, depending for this fact on considerations which 
it is to be hoped will never be reproduced; it is also the most widely known, if 
not always the best understood with reference to superior educational and voca- 
tional groups, and the only one readily available in several alternate forms of 
substantially equal difficulty. . . . 

The evidence is striking that the relative difficulties with the Army Alpha, 
even in its present cumbrous and incongruous, often laughter-provoking state, 
are sTxU those of administration rather than of meaning. 

4 - INTELLIGENCE TESTS IN INDUSTRY 
The field of industrial psychology has much to contribute to a knowledge 
of certain skills, types of behavior, and special abilities at adult levels, but 
Jess to contribute on the problems of sdnlr intelligence. Except for a abort 
period just after the War, intelligence tests have been little used. 51 They 
have been shown to be of value in establishing the limit of intelligence be- 
low which success in the particular occupation is decidedly improbable, and 

* 1 F. I- Wdls, Army Alpha. Kerim?, Person- J, *932, 10, 411-417. 

11 A. W. Korahiiaei lad F. A. Kingsbury, Psychological Tests is Bsuinrs*, >924. 
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in some cases they have established a limit above which interest and satis- 
faction in the job are unlikely . 33 Except perhaps in clerical occupations they 
have shown low correlations with success in work. On the whole their uses 
have not been of sufficient immediate value to lead to extensive surveys, 
and the findings available are on small groups, selected in one regard or an- 
other, and not satisfactory material from which to draw conclusions as to 
the level or characteristics of mental functioning in the adult. 

The wide range of test intelligence found among individuals in one type 
of occupation was evident in the Army data. Fryer’s study of this material 
summarized the findings for ninety-six occupations, giving interquartile 
score ranges as well as median Army Alpha scores for each . 34 A similarly 
wide range has been found in a recent investigation by Cattell, in which he 
attempted to standardize an intelligence test for adults . 35 He listed twenty- 
five occupations from which he had examined 12 or more members; in spite 
of the fact that the numbers in some of these groups were small and that his 
test 38 differed somewhat in nature from the Alpha, there is a fairly close cor- 
respondence between his listing of occupations in order of intellectual level 
and Fryer’s listing. 


5. SEX DIFFERENCES IN INTELLIGENCE TEST PERFORMANCES 

From the time of Thompson’s 57 pioneer work until recent years, almost 
all studies of sex differences in intelligence had been based on the findings 
for children. Garrett summarized these as follows : 38 

Most investigators have found the average boy to be superior to the average 
girl in tests involving numerical or mathematical relationships, as well as in the 
ability to employ spatial and geometric concepts. Girls, in turn, are usually su- 

** For studies in this field, sec: 

K. M Cowdcry, Measures of Genera! Intelligence as Indices of Success in Trade Learning, 
J. Appl. Psycho! , 1911, 6, 31 1-330. 

E. Bun, Minimum Intellectual Levels of Accomplishment in Industry, J Person. Res 
1913. 3. 207-aia. 

For a summary dueuMum, sec M, S Vtfdes, Industrial Psychology, tti-tij. 

** D Fryer. Occupational-Intelligence Standards, Sch k Soc, 1912, 16, 273-277, 

88 R. B. Cattell, Occupational Norms of Intelligence and the Standardization of an Adult 
Intelligence Test, Brit. J. Psychol., Gen Sec, «934-35. *5. 1-28. 

84 The sub-tests were synonyms, classification, opposites, completion, and inferences the 

types of performance which had, in Cattell'* opinion, been shown to correlate most hiehfv 
with "g" without overlap. ® 7 

88 H, B. Thompson, The Mental Traitt of Sex, 1903. 

88 H. E. Garrett and M. R. Schneck, Psychological Tests, SfrtWj, and Remits, Part II 198 
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pcrior to boys in tests requiring memory and in the ability to employ language 
relationships quickly and precisely ... the consistency of the results reported 
by different investigators lends weight to the belief in the existence of a true 
difference. 

In a study of over 2,000 school children Pressey found that the girls' lead 
in verbal material was gradually lost with age, but the boys were superior in 
arithmetic at all ages. 25 A numlier of other studies have shown, however, 
that the girls’ superiority in verbal material persists at the college level. 

The most conclusive recent evidence is that from the results of the Scho- 
lastic Aptitude Test. Each year since the mathematical section was added, 
the girls have been superior in mean scores on the verbal and .the boys on 
the mathematical portion of the test. 40 In most years the population was 
about 8,000 and the differences in means were clearly significant. It is to be 
noted that Brolyer considers the group of girls more highly selected than 
the group of boy's, a condition which would tend to produce a difference 
smaller than the true one in mathematical material and larger in verbal. 

Evidence of reliable differences is not so dear among adults. In his study 
of family resemblances, Carter found that sex differences in adult scores 
were slight in comparison with individual differences.* 1 It is to be noted, 
however, that the difference divided by the sigma of the difference for the 
computation tests, on which the men surpassed the women, was 7.74, while 
this ratio for the vocabulary test, on which the women were superior, was 

I.OO. 

Conrad and Jones, whose important study of adult intelligence has al- 
ready been mentioned, found slight superiority on the part of the females; 
it was greater during adolescence than in later years but in their opinion not 
sufficient to be of any practical importance. 42 The males were rather con- 
sistently inferior on those four tesrs of the Army Alpha which the authors 
thought most “strongly verbal” (common sense, opposites, disarranged sen- 
tences, and analogies). The differences on the other tests arc reported in the 
following statements: 43 

** I- W. Passey, See Difference* Sboars by 3.J-H School Children pa a Croup Scale of In- 
telligence, with Special Reformer to VanabUay, J. Appl. Prycbol.. J 9 ‘S, 2, 333-340. 

49 C. R. Brolyer, Ftfth-SiKlh Reports of the Commission on the Scholastic Apntvde Test, 

1930-1934- 

41 H. D. Carter, Family Resemblances in Verbal and numerical Abiltties, Genet. Psychol. 
Mooog n 193a, is, 1-104. 

41 Op ct.. 365-1 66, 

4 * Conrad, Jones, and Hsiao, Sec Differences in hfer.tsl Growth and Decline, J. Ednc. 
Psychol, 1933, 34, 166. 
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In test I (oral directions) the males are in early adolescence as inferior as in 
the more highly verbal tests; but at later ages they acquit themselves relatively 
much better, overtaking the females in the period of middle maturity, and again 
dropping slightly behind in later maturity. Test 8 (information) is one which 
has commonly been regarded as favoring males. . . , Our obtained sex differ- 
ences are relatively slight in this test, and with frequent intersections of the 
growth curves. In test 6 (number series completions) a similar condition occurs. 
. . . [In 2) (arithmetic problems) wc find the only instance in which males are 
fairly consistently superior to females. The male superiority does not clearly 
emerge until later adolescence. . . . 

Small differences, varying in direction according to the nature of the ma- 
terial arid dissimilarities in interest and practice, were also found for the 
moving picture test which Conrad and Jones reported. 44 

Miles and Miles discovered irregular trends in the average scores for men 
and women of successive decade groups on a fifteen-minute form of the 
Otis Self-Administering Tests of Intelligence, but they were inclined to re- 
gard the differences as “possibly due to errors in sampling.” 45 The men 
were superior in most of the age groups. "When comparisons were made 
between five large-range age groups, the men were superior in four in- 
stances and in two of these the differences were great (with a difference 
divided by the sigma of the difference of 3.58 for ages fifteen, to twenty-nine 
and 2.61 for ages fifty to sixty-nine). It is to be noted that these comparisons 
are based on an omnibus test, and differences between the sexes in particu- 
lar types of performance, such as Jones and Conrad found, may thus have 
been hidden. Other parts of the Stanford Maturity Study, that is, investiga- 
tions of other activities besides those involved in the Otis Test, have shown 
few significant or large sex differences, however; and most of these investi- 
gations are of specific types of activity in which differences between groups 
should have been detected if they had existed 48 

6. THE GROWTH AND DECLINE OF INTELLIGENCE 

The attention of psychologists, particularly of those concerned with the 
construction or interpretation of mental tests, has long been focused on the 
problem of the growth of intelligence 47 It has been attacked by comparing 

41 Conrad and Jones, Psychological Studies of Holton Return V. Adolescent and Adult Sex 
Differences in Immediate end Delayed Recall, J. Soc. Psychol., 1931, 2, 433-459. 

41 Op at , 64. *® See pages 31-32. 

47 For discussion of the earlier contributions, see F. N. Freeman, Mental Tests, igafi, chap. 
XIII, and F. D. Brooks, The Psychology of Adoleseence, 1939, chaps. IV and V. 
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the test performances of different groups of children at successive ages, 
and by retesting the same groups of children at an interval of one or more 
years. There has been careful work on sex differences in mental develop- 
ment, on the relations between physical and mental growth, and on many 
other aspects of the problem. The question of the limit of mental growth 
has of course been a part of the developmental studies of children, but 
until recently no studies were continued through successive ages beyond 
adolescence. Consequently, it was never possible to determine the actual 
course of any test performance with age. The only data available were the 
Army findings, with different age groups which were probably not com- 
parable, and a small number of studies of special groups of adults — fire- 
men, policemen, prisoners, professors, and various other types of employed 
and unemployed. 48 None of these data demonstrated the peak of develop- 
ment or the rate or characteristics of the decline beyond maturity. 49 The 
first important work on these problems began in the late i^o’s with Thorn- 
dike’s experiments on adult learning, Willoughby’s study of family resem- 
blances, and Jones and Conrad’s investigation of the relation between age 
and test performance in a large group of adults. The results of this work 
must be reviewed in some detail, as well as the findings of Miles’ Stanford 
Maturity Study. 

•• Most of these studies arc on selected groups, which are difficult to locate in the population 
at large and will not be discussed in this chapter. If a sufficient number of such groups were 
studied by comparable methods, they would, of course, be valuable as component parti of a 
larger population which could be weighted to approximate that of the census figures. 

For examples of these special studies, the reader is referred to the following articles: 

H. H. Caldwell, Adult Tests of the Stanford Revision Applied to University Faculty Mem- 
bers, J. Exper. Psychol, 1912, 5, 347-36*. 

R- Pintner and A. Toops, A Mental Surtey of the Population of a Ti'or\house, J, Delinq, 
1917.2,275-287. 

I- M. Terman, et al, A Trial of Mental and Pedagogical Tests in a Cud Sertice Examina- 
tion for Policemen and Firemen, } Appl. Psychol , 1917, I, 17-29. 

F. D. Fry, The Correlation of the Reverse Audito-Vocal Digit Memory Span with the 
Crreral Intelligence and other Mental Abilities of 308 Prisoners of the Eastern State Peni- 
tentiary of Pennsylvania, Psychol. CUn, 19JI, 19, 156-164. 

48 There were a few early studies of senile decline, notably those by Beeson (Intelligence at 
Senescence, J Appl. Psychol, 1930, 4, 319-254) and Foster and Taylor ( The Applicability of 
Mental Tests to Persons over Fifty Years of Age, J. Appl. Psychol , 1920, 4. 39 5®? • 
application of the findings has been limited by the fact that the groups were selected. Intcrest- 
ingfy enough G. Stanley Hall did not mention these two papers in his large volume on Senes- 
cence m 1922, perhaps because be did not consider the existing mental tests applicable to 
adults. He wrote: "There are no mental tests of generally recognized validity above the teens, 
so that we have no criteria for determining psychological age for even the elderly ... 

(page 196). 
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Before discussing the data on mental growth or decline, however, it is 
first of all necessary to note that any findings must be interpreted in the 
light of the type and difficulty of the material the test presents and of the 
scaling method employed. The results should be determined separately for 
different types of test, and the test should always be suited to the mental 
maturity of the subject. The scaling method must, of course, be independ- 
ent of assumptions as to the nature of mental growth. 

The mental-age method of scaling, with the linear relationship between 
mental and chronological age, is therefore unavailing in the study of men- 
tal growth. The early investigators, when they did not report results in 
terms of mental age, reported raw scores. Thurstonc and Ackerson, 50 in 
connection with a report of Binct Tests scaled by Thurstone’s method of 
absolute scaling 51 note that the raw score is also an unsatisfactory unit of 
measurement for it is probably not the same in different parts of the scale. 
The studies based on a method of absolute scaling are few, however, and 
of comparatively recent date. 

One of the most widely quoted early studies is Teagarden’s. 52 She tested 408 
children over twelve and a half years of age at Mooseheart, a home maintained 
by the Loyal Order of Moose for orphans of its deceased members. She consid- 
ered the group low-average in relation to the population at large, but relatively 
unselected up to eighteen years. Of the results reported, those for the Stanford 
Binet Scale are unsatisfactory because of the mental-age scaling, but those for the 
Army Alpha furnish interesting material for further analysis. Teagarden used 
raw and T-scorcs for each half-year group, preferring the T-scorcs although the 
results for the raw and T-scores did not differ markedly. Sigmas were also given 
for the raw scores, however, so that these are more satisfactory for analysis. 

Teagarden used the average-of-three method of smoothing her curves and 
thus eliminated the irregularity of the progression from one half-year group to 
the other which is evident when the senes of medians is surveyed. 63 None of the 
differences between medians is statistically significant. By taking twelve instead 
of six months as the unit, so as to increase the number of cases in each group, 
the irregularities may be eliminated and the general trend brought out more 

sol, l. Thurstonc and L Aciersoa, The Mental Gratvth Curve for the Binct Tent, J. 
Educ. Psychol , 19 J 9> 569-583- 

81 Thurstonc, A Method of Scaling Psychological and Educational Tests, J Educ Psychol, 
1915, 16, 433 - 45 *. 

ea F. M. Teagarden, A Study of the Upper Limits of the Development of Intelligence, 
Teach. Coll. Conui to Educ., *914* No - *56 

88 If the distribution* were, as Teagarden believed, approximately normal, means and me- 
dians would not differ appreciably 
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the test performances of different groups of children at successive ages, 
and by retesting the same groups of children at an interval of one or more 
years. There has been careful work on sex differences in mental develop 
ment, on the relations between physical and mental growth, and on many 
other aspects of the problem. The question of the limit of mental growth 
has of course been a pan of the developmental studies of children, but 
until recently no studies were continued through successive ages beyond 
adolescence. Consequently, it was never possible to determine the actual 
course of any test performance with age. The only data available were the 
Army findings, with different age groups which were probably not com- 
parable, and a small number of studies of special groups of adults— fire- 
men, policemen, prisoners, professors, and various other types of employed 
and unemployed. 49 None of these data demonstrated the peak of develop 
ment or the rate or characteristics of the decline beyond maturity. 49 The 
first important work on these problems began in the late r 920*5 with Thorn- 
dike's experiments on adult learning, Willoughby's study of family resem- 
blances, and Jones and Conrad’s investigation of the relation between age 
and test performance in a large group of adults. The results of this work 
must be reviewed in some detail, as well as the findings of Miles’ Stanford 
Maturity Study. 

45 Most of these studies are on selected groups, which are difficult to locate in the population 
at large and will not be discussed in this chapter. If a sufficient number of such groups were 
studied by comparable methods, they would, of course, be valuable as component parts of a 
larger population which could be weighted to approximate that of the census figures. 

For examples of these special studies, the reader is referred to the following articles: 

H. H. Caldwell, Adult Tests of the Stanford Revision Applied to University F acuity Mem- 
bers, J. Exper. Psychol., jgaa, 5, 347-361. 

R. Puitner and A. Toops, A Mental Sun ey of the Population of a Workhouse, J. Delinq, 
1917, 3, 278-387. 

L. M. Term an, ct a!., A Trial of Mental end Pedagogical Tests in a Cnil Sen ice Examina- 
tion fir policemen and Firemen, J Appl. Psychol., 1917, 1, 17-29 

F. D. Fry, The Correlation of the Reverse Audito-Vocal Digit Memory Span with the 
General Intelligence end other Mental Abilities of 30S Prisoners of the Eastern Stale Peni- 
tentiary of Pennsylvania, Psychol. Clin., 1931, 19, 156-164. 

48 There were a few early studies of senile decline, notably those by Beeson ( Intelligence at 
Senescence, J. Appl. Psycho!., 1920, 4, 319-234) and Foster and Taylor ( The Applicability of 
Mental Tests to Persons over Fifty Tears of Age, J. Appl. Psychol , 1920, 4, 39~sW* ^ ac ^ te 
application of the findings has been limited by the fact that the groups were selected. Interest- 
ingly enough G. Stanley Hall did not mention these mo papers in fus large volume on Settef- 
cence in J922, perhaps because he did not consider the existing mental tests applicable to 
adults He wrote: "There are no mental tests of generally recognized validity above the teens, 
so that we have no criteria for determining psychological age for even the elderly . • . 

(page 196). 
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1933, J 1 Method °f Sealing Psychological and Educational Tests, J Educ. Psychol , 
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dearly. 6 4 The new figures are shown in Table I. The reliability of the differences 
between the medians (here treated as means) for succeeding years is also given. 

The difference between the medians which most nearly approaches statistical 
reliability Is the rise between the groups with midpoints at fifteen and sixteen 
years. There is a slight rise in median score between sixteen and seventeen, but 
none at all between seventeen and eighteen. 63 These figures would indicate an 
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earlier cessation of growth than Teagarden reported, a cessation between sixteen 
and seventeen rather than between seventeen and eighteen. They wo so in 
dicate a positively accelerated curve between thirteen and sixteen, vv ereas ea 
garden concluded that this curve was negatively accelerated. It is har to say o 
much of the negative acceleration she found depended on the average-o r 
method of smoothing the curve. 

The analysis Thurstone and Ackcrson made of the results of Binet Tests 
scaled by the method of absolute scaling showed a positively acc erate 
curve up to the age of about eleven years with negative acceleration ere 


** Age groups above eighteen were omitted because they were small and probabl y se! « 

« Ai analysis of the sLford Binet along the same lines as tins analyst* 
some doubt on Teagarden’s conclusion that “the stopping place in the cur ’’* ** occurs a 
in Alpha than in the other tests” (page 6a), since the most reliable nse in 
year earlier than that in the Alpha. 
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after. 80 The population they studied was slightly retarded, and more varia- 
ble than the average; the subjects were 4,208 white children, three to seven- 
teen years of age, entering the Institute lor Juvenile Research. The growth 
curve obtained was asymptotic to absolute zero and asymptotic to the level 
of test maturity. The authors noted that they had of course no rational pro- 
cedure for locating the level of maturity according to the Binet Test as for 
locating the absolute zero. They found that the brighter children reached 
the inflection point of about eleven years sooner than the dull, and that they 
reached test maturity sooner than the dull. The authors stated that: "The 
absolute variability of test intelligence increases with age until adult intelli- 
gence is attained.” 

Richardson and Stokes, on the basis of results obtained for the Simplex 
Junior Intelligence Scale given to 12,000 children of ages six to fourteen, the 
total population, between these ages in one town in England, supported 
Thurstone’s conclusions as to variability. 51 They found that the absolute 
variability of intelligence increased with age, and that the spread of intelli- 
gence at any age was proportional to its mean level at that age. Considera- 
tion of the growth curve indicated that the sigma divided by the mean was 
a constant equal to about .18. The mean growth curve obtained was an al- 
most perfect G curve, with the inflection point at about four years. 83 Rich- 
ardson noted that "from 6 years onward we are dealing with the upper 
half of the developmental curve which has reached by n years a level of 
over 80 of maturity. Hence the differences we are accustomed to meas- 
ure with our tests, between one year and another, and one individual and 
another, will often be quite small when viewed against the background of 
the absolute scale.” He speculated as to the probable form of the growth 
curve below and above the ages examined, and suggested with regard to 
the upper end that development probably reached 96 per cent by eighteen 
years and 99 per cent by twenty-four. 

Another study by means of the method of absolute scaling has been car- 
ried out by Odum. 83 His data were also very extensive and based on the re- 

*• Op. a t . 569-583. 

81 C. A. Richardson and C. W. Stokes, Tie Growth and Variability of Intelligence , BnL J. 
Psychol., Monog. Suppl., 1933, 18, 1-83 

E ® The growth curves for the separate items were also G curves, but most of them showed a 
slight kink somewhere between nine and a half and thirteen and a half 

89 C. L. Odum, A Study of the Mental Growth Curve with Special Reference to the Results 
of Group Intelligence Tests, J Educ. Psychol , 1929, so, 401-416. 

Different tests were used at different ages, the total senes including the Dearborn Scale, 
Series I and II; the Ons Group Scale, Primary and Advanced; the Illinois Group Intelligence 
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suits of group tests. His population ranged in age from five to nineteen 
years. Odum found that the growth curve showed negative acceleration, 
and that the variability in the case of each test increased as chronological 
age increased. He concluded that the ability of children to score on group 
intelligence tests did not “stop growing before the age of 17, and very likely 
not until a later age.” 

The crucial ages from fourteen to eighteen have been the subject of spe- 
cial study as a part of Thorndike’s work on mental growth. In 1922 and 
1923 he gathered material on retests of 8,000 high school students, grades 
nine to eleven and ages fourteen to eighteen, and, after allowing for the efi 
feet of practice, found evidence of a steady increase in score from age to age. 
He concluded that any decrease in gain with age was offset by increasing 
selection of the group/ 50 To study the influence of age freed from the influ- 
ence of selection, he later compared the gains made in a year by a large 
number of thirtccn-ycar-olds from the ninth grade and fourteen-year-olds 
from the tenth, the basic assumption being that about the same amount of 
ability was required to be in the tenth grade at fourteen as to be in the ninth 
at thirteen.® 1 The results showed a slightly larger gain in the thirteen-year 
group. 

Thorndike is one of those who have most emphatically contended that 
mental growth as indicated by test performances continues beyond adoles- 
cence. In The Measurement of Intelligence, he stated : w 


Neither in our CAVD results, nor in the National-Ocis-Haggerty estimate, nor 
in Brooks’ results is there any justification for the doctrine that the gain in u- 
tude of intellect of the sort measured by existing intelligence tests is zero after 14, 
or after 15, or even after 16. It decreases, but it should not become inapprecia e 
until 18 or later. According to our results the decrease from 14 to 18 is not an 


Scale; and the National Intelligence Test, Scale A. The numbers taking each of these tests 
ranged fiom 2.000 to 40,000. , 

M E. L. Thorndike, On the Improvement in Intelligence Scorer from Fourteen to Eig e . 
J. Educ. Psychol., 1923, 14, 313-516. . . . 

Thorndike, On the Improvement in Intelligence Seoret from Thirteen to Nineteen, J- 
Educ. Psychol , 1926, 17, 73-76. 

al Thorndike, The Measurement of Intelligence, 467. . . 

It is to be noted that Thorndike suggested that there was a "differentia! gain from e S 
of fourteen to twenty and beyond, whereby some individuals increased these abilities ay 
greatly, whereas others increased them little or not at all." On this basis he wou accoun 
the change from symmetry and normality characterizing the distributions of testmte 
for children to the positive skewness shown in the distribution of the sort of utte mea 
by Alpha for the adult population. (Page 293.) Sec page 8. 
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abrupt slowing up of a gain that has been steady hitherto, but is part of a gen- 
eral negative acceleration which began long before the age of 6 '/j. 

The type of curse Thorndike described continued to rise longer than 
that found by Thurstone, a difference which may well depend on the difi 
fcrcncc in discriminative power between the Binct and the CAVD tests for 
the older subjects. At the younger ages Thorndike’s curve is not unlike that 
obtained by Richardson and Stokes: both showed positive acceleration in 
the very early years, followed by negative acceleration. Thurstone would 
place the inflection point later, about eleven years. Odum found no inflec- 
tion point between five and nineteen and described a negative acceleration 
throughout this age period. 

Thorndike’s studies of learning also indicated a limit of growth which 
fell considerably beyond early adolescence. He set the age somewhere near 
twenty.** As to the characteristics of learning in the years beyond twenty, 
he concluded: 0 * 

The general tendency from all our experiments is for an inferiority of about 
15 per cent as a result of the 20 years from twenty-two on. Learning representing 
an approximation to sheer modifiability unaided by past learning shows consid- 
erably more inferiority than this. Actual learning of such things as adults com- 
monly have to learn shows considerably less. . . . 

Adults may be expected to learn harder things than they could have done in 
the years from six to sixteen and to learn the same things in ways requiring 
greater powers of abstraction and reasoning. 

One of the first studies of adult intelligence after the Army survey was 
the investigation Willoughby made in 1927, which had as its immediate 
purpose an analysis of family resemblances. 05 The families selected for ex- 
amination were those having children aged thirteen in the schools around 
Stanford University. The territory is probably a superior one, and a further 
factor in the selection of the group is that almost 50 per cent of the families 
refused to cooperate; these may have included a larger proportion of dull 
subjects. While the test levels probably should not be taken as representative 
of the population at large, the findings on the course of various test per- 
formances with age are of great importance. The peaks of ability on the dif- 
ferent tests ranged from seventeen to twenty-seven years: 

Thorndike, AJuIt Learning. 129 _ * 4 IM , I06 and 154. 

** R, R. Willoughby, family St/nilanu'ei in Mental-Test Abilities. Genet. Psychol. Monog , 
*9*7> a. 339-277. 
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Type of test 

Source of test 

Pca\ of alilitf 

Vocabulary 

Stanford Achicx ement 

25 

Science-nature information 

Stanford Achievement 

25 

History-literature information 

Stanford Achievement 

22 

Number series completion 

Army Alpha and Beta 

IS 

Form combination 

Army Alpha and Beta 

19 

Digit-symbol substitution 

Army Alpha and Beta 

x8 

Arithmetic reasoning 

National Intelligence 

20-J- 

Opposites 

National Intelligence, with the 



addition of harder items 

27 

Symbol series completion 

National Intelligence, with the 



addition of harder items 

17 

Verbal analogies 

National Intelligence, with the 



addition of harder items 

18 

Comparison 

National Intelligence, with the 

* 


omission of pictorial elements 

23 


It is evident that the tests showing the earliest peaks were the number and 
non-verbal tests and also the analogies, while those showing the latest peaks 
were tests of vocabulary and information. 

The tests were differentiated not only with regard to peak of develop- 
ment, but also with regard to two other characteristics of the growth 
curves: the extent of senescent decline and the sharpness or roundness of 
the peaks. 08 A decline which carried the curve back to pubertal or prepu- 
bertal levels appeared in the case of the opposites, the number and the 
symbol-series completion, the analogies, the symbol-digit substitution, the 
history-literature information, and the comparison. There was less decline 
for the vocabulary, the form-combination, and the science-nature informa- 
tion tests, and least of all for the arithmetic reasoning. The two tests show- 
ing the sharpest peaks were number-series completion and analogies, while 
that showing the roundest was arithmetic reasoning. In this connection 
Willoughby suggested that the sharp peaks were probably to be found for 
test abilities “acquired more or less as a matter of normal growth and com- 
paratively independent of school training, while a long, rounded peak 
would seem to be the result of laborious and long-continued effort against 
obstacles.” 

The next important study of adult intelligence after Willoughby’s was 
that made by Jones and Conrad in New England. 61 It has already been de- 


1 Willoughby'* curves are continued to sixty years. 
1 Op. rit., 323-298. 
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scribed as unique among studies of adults because it represents a survey of 
a complete or almost complete community. Possible adult resistance to 
mental testing was overcome by propaganda and the offer of a free movie. 
Cases obtained in this way were supplemented by house-to-house testing. It 
is possible that there was some selection in the age groups over forty, for 
there was an increasing number of refusals beyond this age, but on the 
whole the population is one of the most satisfactory ever obtained, and one 
of the few providing trustworthy indices of the typical adult test perform- 
ances at different ages. 68 The only tests given, however, were a test on the 
movie and the Army Alpha, and consequently the extent of the data on 
adult test performances is rather limited. 

The general characteristics of the growth curve on the total Army Alpha 
for thi 1,191 subjects, aged ten to sixty, were found to be “linear growth to 
about 16 years, with a negative acceleration beyond id to a peak between the 
ages of 18 and 21.” 00 Following this peak there was a decline "which is 
much more gradual than the curve of growth, but which by the age of 55 
involves a recession to the 14-year level.” 70 The difference between test lev- 

** The figure* are, or course, representative only for a geographical area similar to the one 
surveyed, where about 87 per cent of the population came from old New England *tock. 

The occupational distribution was similar to that of the Vermont Census for 1910; the 
agricultural groups however, were mote heavily weighted in Jones’ sample 

"Jones and Conrad noted that this growth curve was similar to that reported by Tea- 
garden and discussed above. They found close agreement in the actual temo up to mtcen. and 
suspected that Teagarden’s groups were selected be; old this age (Op cir . 239, 241 ) 
tM , 239. 

Hsiao reported the xam- dan in an M A. thesis at Columbia in 1927. Performance of Army 
Alpha at a Function of Age. There are some discrepancies between his dara and those given 
by font* and Conrad The number of cases was the same for the two studies in only two age 
group*, the mediant the tame in only three There are differences in the growth curve* which 
depend on the (act that Hsiao give equal distances on the base line to each step to hit distribu- 
tion, while Jones and Conrad allowed for the unequal steps Furthermore, Hsiao smoothed hi* 
curses by the a\erage-cf three method, whereas Jones and Conrad took as original data the 
average of the mean and the median and then smoothed the curves by eye. 

Hsiao reported the reliability of the difference* between total Alpha score* in successive age 
groups. The differences between the successive year groups from eleven to sixteen are nearly re- 
liable in four instances and reliab’e in one, that between thirteen and fourteen. In the adult 
period, none of the differences is reliab'e, but the decline for the group at forty five to forty- 
nine as compared with that at forty to forty four is almost reliable There are rises, although 
neither is reliable, in the semi-decade thirts five to thirty-nine and again in the semi-decade 
forty to forty-four. In all the sub-tests except Test 7 the medians for the age range forty to 
fbctv-fbur are higher than those for the age ranee thirty-fire to thirty-nine. 

The rise in the period forty to forty four also appeared in MurseJl* examination* of pris- 
oners by the Kuhlmann-Andersoa and Ku.Mms.nn Binet Tests (Decreate is Intelligence «eiM 
Inerrate in Age among lnna‘et of Feral Inmtanone J Juv Res, 1919, 13, 197-203), As in 
the study by Jones and Conrad the me was followed by a sharp drop after forty five yean. 
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els in the different age groups led Jones and Conrad to make the following 
significant statement: 

A distinction . . . needs to be made between peak of development, and the 
“mental age of adults.” The latter phrase is as a matter of fact no more justifiable 
than would be the expression the “mental age of children.” In both cases (with 
possibly rare exceptions) the mental age changes significandy with the progress 
of the years: among the children there is growth; among the adults decline. 71 

Consequently they gave no median mental age for the adult group as a 
whole 72 The T-scorcs were, however, based on the variability within an 
age range of fifteen years, namely twenty-five to thirty-nine, and thus sig- 
nificant growth or decline within this period were in effect excluded. 

The growth curves for the separate sub-tests of the Army Alpha showed 
considerable diversity. The most rapid declines appeared for Tests 7 
(Analogies), 3 (Common Sense), and 6 (Number Completions). Jones and 
Conrad suggested that these should probably be considered the best tests of 
the Army Alpha as measures of basic intelligence, that is, the tests most 
free from the effect of environmental variables. On this point they found 
themselves in accord with Thorndike’s conclusion that “age exerts its most 
adverse influence upon native capacity or ‘sheer modifiability.’” Test 1 
(Oral Directions) showed a sudden, sharp drop from the early peak, but 
thereafter reached a level which was well maintained till sixty. Tests 2 
(Arithmetic Problems) and 5 (Dissected Sentences) showed a gradual but 
not marked decline with age. The two tests manifesting little or no decline 
with age were 4 (Opposites) and 8 (General Information) . The authors 

71 Jones and Conrad discussed at some length the additional factors which might influence 
the declining curves. ( Op . at , 254-157.) They considered the possibility of slighter motiva- 
tion on the part of the older subjects, poor eyesight or hearing, the remoteness of schoot train- 
ing, and disuse of functions, an inability to work rapidly, and a difficulty in understanding 
questions. Comparisons of the cases tested in the meeting halls and the cases tested at home, 
together with comparisons of the findings on different types of tests, some of them Jess influ- 
enced than others by school training, speed, etc- seemed, however, to demonstrate that the 
declines depended not on any of the above-mentioned "factors, bnt on increasing age. 

7 * While the median score on Army Alpha for the total native-white draft was 59. the 
median for the 802 cases from Vermont taking Alpha only was 67. for the 710 cases from New 
Hampshire 62, and for the 1,134 from Massachusetts 7a. (Mem. Nat. Acad. Sci, 1921, *5, 
682-683.) It is natural, therefore, that Jones* medians should be slightly higher than the total 
median, but it b surprising that they should be so much higher; for his groups over thirteen 
years otd the medians ranged from 71 for the age period fifty to fifty-four to 93 for the age 
period nineteen to twenty-one. The sample is such a complete one that the higher medians 
must indicate either a community group which is actually superior to the population at large or 
better testing conditions or both. 



THE GROWTH AND DECLINE OF INTELLIGENCE 25 
suggested that these tests, because of increasing experience, should really 
show increased scores with age; in other words, that the sustained rise was 
not of the same significance as a sustained rise in tests of more clearly 
"basic intelligence” would be. 73 

A comparison of the curves obtained by Willoughby and by Jones and 
Conrad for tests similar in type of performance required shows certain dif- 
ferences in peak of development and rate of decline but no marked dis- 
parities. In the study made by Jones and Conrad there is a sharper decline 
for analogies and considerably less decline for opposites. In both, number- 
series completions manifest a definite drop while arithmetic reasoning and 
information are relatively well maintained. 

A summary of the data with regard to the peak of development, or the 
ages at which mean scores are highest, is presented in Table II. Whenever 
possible tests were grouped as verbal or non-verbal, and when no such dis- 
tinction was possible the test was classified as "doubtful.” 14 On the basis of 

T * Janes and Conrad stated that both of these were information tests, which should prob- 
ably not be included in a battery to be used ior groups of different ages. It seems probable 
that the difficulty is one of interpretation, that both are useful tests providing satisfactory 
norms are established for different age levels. 

Wilson recommended Test 8 (Information) as an adult test, (Information as a Measure of 
Intelligence and Maturity, J. Educ. Psychol, 1924. 15, 309-312 ) Conrad, using Hsiao's data, 
alio recommended it. ( General-Information , Intelligence, and the Decline of Intelligence, J. 
Appl. Psychol., 1930, 14, 392-599 ) Conrad argued that it was a test acceptable to adults, and 
suitable for them since their scores did not decline with age, and that it correlated fairly well 
with total Alpha scores. He pointed out, however, that an apparent increase in scores on Test 8 
may mean an actual decline, if the older patient shows less increment than his age itself should 
brirg about. It is to be noted, however, that Spearman found the information test unsuitable for 
intelligence testing (T he Abilities of Man, 270-277, 362), and that Wells deleted it in his re- 
vision of the Army Alpha. 

As to the continued use of opposites tests in the study of adult intelligence, there would 
probably be a greater number of ayes than nays. The test has generally been found to correlate 
highly with measures of so-called general intelligence. The Army examiners found the oppo- 
sites test of their first scale the best in the examination “as a measure of the traits which enter 
into a Stanford Biaet mental age." (Mem Nat Acad. So , 1921, 15, 33a ) A version of the 
opposites is one of the three sub-tests remaining in the verbal section of the Scholastic Aptitude 
Test 

The correlation between Test 4 and Test 8 ts the highest of the intercorrelarions of the 
Alpha sub-tests for the Army data While Test 4 has the highest correlation with Stanford 
Binet, Test 8 has only the fifth highest. The high correlation between them may possibly be a 
spurious one, resulting from their similar correlation with age. The fact that they have simi- 
lar age curves docs not, of course, indicate that they arc tests of closely related performances. 

T * Of the third group, the arithmetic reasoning test is clearly distinct from the verbal, and 
yet can hardly be classified with ihe tests which are in the non-verbal group. In view of 
Kelley's analysis of the factors in the Army Alpha, the directions test doei not seem to fit in 
with the other verbal tests, nor are the analogies tests altogether satisfactory ut that class. (Op. 
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the existing data it is difficult to judge to what extent the differ cnees In the 
ages at which the peaks appear depend on inadequacies in certain of the 
tests, and to uhat extent they depend on actual differences in the matura- 
tion rate of different abilities.* 9 The present results show that the peaks ap- 
pear later for most of the verbal test abilities than for the predominantly 
non-verbal, and that the analogies test, as well as the arithmetic reasoning 


TABLE II 

AGES AT WHICH HIGHEST MEAN SCORES ARE FOUND IN DIFFERENT 
TYPES OF TESTS (WILLOUGHBY AND JONES AND CONRAD) 


VI Bit At, 

Ten 

Age 

. NON-VERBAL 

Ten 

Age 

DOUBTFUL 

Ten 

Age 



WJUOUCHBY 




Oppoiite* 

27 

Numler-Seriet Couple- 


Arithmetic Rea »o rung 

30 

Vocabulary 

2? 

non 

18 

Ana'ogie* 

>8 

Science Information 

25 

Symbol Series Comple- 




Companion 


linn 

18 



llutory and Literature 


Form Combination 




Information 

31 

Digit-Symbol Substitu- 






twn 

18 





JONES AND CO VBA D 




Practical Judgment 

l8 



Direction 

21 

„ . \ 




Arithmetic Rea toning 






Analogic* 

16 

Directed Sentence* 






Information 

35-39 







and possibly the directions and practical judgment, are to be classed with 
the non-verbal tests in regard to the early age at which the maximum 
mean score appears. 

As to the data on variability Jones and Conrad reported that a differ- 
ential growth rate in adolescence was clearly indicated by increase in sigma 
not only for Alpha but for every sub-test. Except in Test 7 (Analogies), 
they found no evidence for a differential duration of growth. 

Although Thurstone and Ackcrson specifically limited the application of 
their law of absolute variability, stating that absolute variability of test »n- 

f * A further indication of differential gain appear* in a study of the growth of intelligence 
at the college levet by Rogers at Bryn Mawr. ( The Cron th of Intelligence at the College Level. 
Sch. & Soe., 1930, 31, 693-699.) She found the most marked gain in reading ability. 
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tdligcncc increased with age until adult intelligence was attained, 78 Jones 
and Conrad seemed to consider the extension of the law. They stated: 77 

It is interesting to note that fulfillment of Thurstone’s law of absolute vari- 
ability would require that the decline in intelligence-test scores be accompanied 
by corresponding decline in absolute standard deviations — implying more rapid 
decline of intelligence among the brighter than among the duller of the adults. 
Using raw Alpha scores, we have failed to obtain a general linear relationship 
between mean and standard deviation. Whether or not the same absence of a 
general relationship would be found with Alpha scores in “absolute” scaling re- 
mains to be determined. 

Certainly neither common sense nor the evidence from scientific studies 
would indicate that the senile decline began earlier in the more intelligent. 

The Stanford Maturity Study is, as already noted, somewhat less satis- 
factory than that of Jones and Conrad from the point of view of the 
sampling of the population. Subjects were secured through church groups, 
philanthropic societies, clubs, and lodges, and their services were remun- 
erated by payment to the organization rather than to the individual. 73 The 
method was ingenious in that it aroused group interest and brought in a 
much larger proportion of the total community than would have been se- 
cured if subjects had been solicited individually. It is probable, however, 
that the groups of adults who have joined themselves to any organization, 
with the possible exception of the church, represent selected samples. 79 

A much more ambitious program of testing was planned than that car- 
ried through by Jones and Conrad, for the purpose of the Maturity Study 
was to determine adult performances not only on intelligence tests, but also 
in perception, motility, memory, the understanding of spoken directions, 
and other fields. In the testing program of 1930, Miles planned as compre- 
hensive a test battery as could be administered in a two-hour period. He 
found that under proper conditions all subjects could be induced to give 

i* Op. at , 582. 17 Op at , 269 T * Miles, op at , 107 

7 ® Evidence that the percentages of adults belonging to clubs or other organizations differ 
markedly according to social and financial status has recently appeared in the report of the 
White House Conference on Child Health and Protection, Section III A In the questionnaire 
study of personality development in about 13,000 public school children of the eighth, ninth, 
and tenth grades, it was found that 53 P« cent of the poor mothers belonged » no organiza- 
tion — church, social, or educational This surprisingly high percentage fell ui the case of the 
low er-tniddle<l ass mothers to 31, in the case of the upper-middie-dass mothers to 12; and in 
the case of the wealthy only 7 per cent belonged to no organization. (The Adolescent in the 
Family. New York D Appleton-Century, 1934 Table 47, page 359 ) Corresponding data 
were apparently collected for lathers but were not gtven in the report 
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thli amount of time to the work; he noted, however, a danger of "waning 
interest" even within these two hours, and slated further that not all 3 duhs 
who serve once will sene twice and fewer still a third or fourth time.* 0 The 
returning groups must therefore have been increasingly selected if new 
subjects were not added. Many ness* subjects were taken on in the testing 
program of 1932, and the investigation as a whole is based upon a number 
of different groups of subjects, constituting altogether the most extensive 
survey of adult performance ever obtained, with the exception of the Army 
survey. The work h3$ so many ramifications that a complete estimate is not 
yet possible. 

The parts which are of greatest interest in connection with the present 
study of adults are the results, first, of the intelligence test given to the 
main group of 823 subjects, aged seven to ninety-four, and, second, of cer- 
tain of the particular studies, notably those of the simpler perceptual and 
motor abilities, the understanding of spoken language, and the adaptability 
to test situations. 

The intelligence test employed was very brief, a form of the Otis Self 
Administering Test of Intelligence, Higher Examination, Form A, given 
as a fifteen-minute test.* 1 In justification of such a short test Miles and 
Miles reported that it showed a correlation of -f .86 ±jk with the thirty- 
minute form in the limited range of superior ability represented by tar uni- 
versity students, and gave other similar data. It cannot be denied, however, 
that a fifteen, or even a thirty-minute test gives a scanty indication of the 
so-called general intelligence, and such tests of course show little as to the 
relationship between different types of performance or their course with 
age. 

In order to make some comparison with the Army results, the scores on 
the fifteen-minute Otis Test were transmuted into equivalent scores on a 
twenty-minute abbreviated form earlier used by Miles and Miles; these 
were then transmuted into equivalents for the original Otis Test, and these 
finally transmuted into the equivalent Army Alpha scores given by Otis. 
Whatever the effect of so many transmutations, the results showed clearly 
that the Army Alpha equivalents for the median Otis scores for Miles 
group were much higher than the Army median of 59 for the native-white 
draft- 82 In short, the judgment as to the level of intelligence of the sample 

*» Mites, Op eil , JOS. * l Mites and Miles, op. nl , 4^7* , 

**The difference is e'en, more marked if the comparison is limited to those of . es su 
jeers who were within the age range of the Army draft. 
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which would be indicated by the nature of the selection seems to be con- 
firmed by these test findings. 


TABLE 111 


SCORES BY DECADES FOR 6l6 ADULTS, CITY B (TABLE VI, MILES 
AND MILES, ABBREVIATED) 




MEAN SCORE ON 
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ABBREVIATED 


D 
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PERIOD 

N 

OTIS TEST 

SIGMA 
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ARMY ALPHA. 

i5-«9 

5* 

3 8 -5 

8.0 

— .11 

I26 

30-29 

80 

38.7 

7*9 

3.53 
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3°~39 ■ 

87 

35-1 

9 6 

■17 

70 S 

4°“49 

90 

34-2 

9.2 

3-27 

I08 

50-59 

119 

299 

97 

2.86 

88 

60-69 

IOJ 

26.x 

10.6 

2.05 

72 

70-79 

68 

23-1 

8.7 

3-54 

61 

80-89 

18 

14.7 

9-3 
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On the question of the growth and decline of intelligence, as indicated 
by test scores on this fifteen-minute test, Miles and Miles stated: 83 

From a maximum score at the age of 18 yr. the curve of intelligence main- 
tains almost a plateau through the “twenties,” begins then to fall, and persists in 
its downward course, at first slowly, then more rapidly until the end of the life 
span. 

An abbreviated form of one of their tables is reproduced in Table III to 
show the changes in test score from decade to decade, and the reliability of 
the differences. 84 

ss Op. at , 60 

“The findings are given lor City B only; the age range in City A was much shorter, from 
fifty to ninety years. 
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The mean scores for the age periods fifteen to nineteen and twenty to 
twenty-nine are almost identical; the decline then begins; it seems to be ar- 
rested in the forties but is marked in each successhe decade thereafter. The 
arrest in the forties is interesting in connection with the Get that Jones and 
Conrad found a slight rise in die age period forty to forty-five.* 3 No such 
rise appeared in Willoughby’s results. An interpretation of the finding is 
difficult for in neither case docs it seem to depend on fluctuations of sam- 
pling* 

In summarizing the results of the Stanford investigation in his presi- 
dential address at Ithaca in 1932, Miles reported as follows on the relation of 
age and intelligence test performance for a larger group than that discussed 
above: 88 

The agc-score curve from performance on a standard intelligence test of 3500 
adulu, (approximately 250 over 70 )ears of age) weighted to represent the level 
of the general population, shows a top mean of 15 to 16 year mental age at life- 
age 18, persistence at practically this same mental level through two decades, 
then gradual decline to old age, registered in the Pearson correlation coefficient 
—.50 for agc-score in the period of adulthood. Stated in another way, the test 
ability gain of the last three years in the period of mental age growth is gradu- 
ally lost in the next three-score years. 

Miles then stated that results for 400 subjects with the speed factor elimi- 
nated showed th3t speed did not slacken before mental power. In a later 
paper, however, C. C. Miles reported that comparisons of the fifteen-minute 
Otis speeded and unspeeded indicated that in early and middle maturity 
speed declined faster than power, whereas in late maturity the decline of 
power was relatively more apparent. 87 

Despite the brevity of the Otis Test and the comparariv cly small number 
of items, these were analyzed with a view to determining their course with 
age. On this question W. R. Miles reported as follows:* 3 


See also Mursell's similar finding, footnote 70, page 23. 

•« Miles, op. at., ufi. 

• l C. C- Miles, Influence of Speed and Age on Intelligence Secret of Auu.tt, J. 


Psychol., 1934, jo. 20S-210. . 

Another interesting finding is reported in this paper. Alter noting that in a'l populations e 
average individual loses at the rate of about six or seven months mental age per decade . ” 
stated that decreases were apparently similar if individuals were retested. This finding wr e 
190 individuals tested in 1930 and retested in 1932 is important even though the test was 


fcrieC lor there have teen very few retests of adults. 


89 Op. eft., 11 6-1 17. 
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. . - verbal associations, generalizations, interpretations of meaning, and recog- 
nition of relations show marked resistance to the influence of age. Speed, organi- 
zation and recall of unfamiliar material, and difficult logical procedures involv- 
ing a relatively wide immediate memory span show’ speedier decline. Perhaps I 
should call the decline registered in our curve not one in intelligence as such but 
rather a diminution in reaction speed and sum of energy available for new work 
types. This would mean that the decline correlates with physiological rather than 
psychological deterioration. 

It seems unlikely that the question of the relation of age and particular 
intelligence test performances can be pursued very far on the basis of the 
items in the brief Otis omnibus test. Willoughby’s investigation and that of 
Jones and Conrad have more to contribute on this subject. As already noted, 
however, the Stanford Study included many other tests besides the “general 
intelligence" test, and the results of these present important data on a num- 
ber of traits or types of performance, many of which do not enter into the 
usual form of intelligence test and have rarely been studied at the adult 
level. 

Tests of visual perception, reaction speed, motility, and complex motor 
skills all gave similar age curves, characterized by highest performances in 
the late teens or early adult years, followed by slow decline to about sixty 
and more rapid decline thereafter. 89 The curves for memory and learning 
ability dropped more rapidly than those for the perceptual and motor ac- 
tivities. The grasping of spoken directions showed a decline amounting to 
approximately a third of the standard deviation per decade, and a correla- 
tion with age (—.51) about that of the abbreviated Otis Test. 80 A series of 
non-language tests constructed or selected to indicate "adaptability” to new 
situations, and correlating from +.30 to -J— 5 1 with intelligence, yielded 
correlations with age ranging from — .03 to — .56 81 In Marshs report on 
these tests, he concluded that human adaptability showed only a slight 
tendency to decrease with age. The point is an extremely important one, far 
if adaptability to new situations fells off rapidly with age, this condition 
may explain the declines on many tests, for example, the code test used by 
Miles in studying memory. Finally, certain of the performances or traits 
studied showed little or no change with age. Aside from the verbalizations, 

»®B*Pnc ™ The Grasping of Spo\en Directions as a« Age Function in Adults, Psychol. 
Bull, *933, 30, 588-589. 

•1C.J Marsh, Human Adaptability as Related to Age, Psychol Bull , 1933, 30. 589 
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generalizations, and so forth as indicated by the Otis items, these included 
imagination , 03 persistence in work, and the four traits studied by the Bern- 
rcutcr test— neurotic and introvert tendencies, self-sufficiency, and domi- 
nance 03 

The demonstration that different test performances reach their maxi- 
mum development at different ages and show different courses with age 
through maturity is one of the most interesting discoveries of the recent 
work. It is a problem which challenges future work as well, because the 
facts are a necessary basis not only for an understanding of mental func- 
tioning in the adult years, but also for the solution of practical questions as 
to the type of work the older person can best handle and the difficulties he 
is likely to meet. In this connection, however, there arc two important find- 
ings which must not be overlooked. One is that even those performances 
which decline rapidly after maturity in comparison to other test perform- 
ances, actually decline slowly in comparison to the rate of development. As 
general observations Would of course suggest, the changes during the adult 
years, at least up to the age of sixty, are slight in relation to the changes 
during childhood. A second important finding, which Miles and other in- 
vestigators have stressed, is that individual variations are large. A gradually 
declining curve from the twenties or thereabouts is apparently the true 
condition, but it must be noted that many individuals in the later decades 
surpass the average scores for the twenties or thirties on a long series of per- 
formances. 

A satisfactory attack on the problems involved calls for the study of dif- 
ferent types of activity, each of which must be tested separately and at 
sufficient length to afford a reliable estimate. Few investigations have as yet 
met these conditions adequately, but they are basic for much of the im- 
portant work which remains to be done, including the study of the rela- 
tionship of mental abilities at the adult level. There are still a number of 
activities which are probably significant in the adult period but have not yet 
been investigated; and it is evident that the satisfactory determination of 
tests for adults, the nature of which will differ according to the purpose of 
the work, awaits the study of these activities. 


Studied by the kinephantom, a rerolving £0 the silhouette of which the subject had to 
interpret; see W. R. Miles, Movement Interpretation! of the Silhouette of a Revolving Fan, 
Amer. J. Psychol , roti. 41 . aaa-jo?. 

M C. C. Miles, Age and Certain Personality Traits of Adults, Psychol. Bull , 1933. 3®. 57°- 
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I. SELECTION OF THE GROUP 
a. Problems involved 

I N selecting normal adults to serve as a control group in the study of 
aphasia, the first requirement was to obtain subjects similar in age and in 
educational, occupational, and cultural status. Survey of the possible sources 
for such a group showed that the most nearly similar sample of the popula- 
tion, and therefore the best control, was to be found in the patients admitted 
to the same hospitals in which the a phasic patients were studied but not suf- 
fering from any nervous disorder or any condition known to affect mental 
functioning. Permission was therefore secured to study patients in the wards 
of the hospitals in which the majority of the aphasic patients were being ex- 
amined: the Orthopedic Hospital and Infirmary for Nervous Diseases, the 
Gta&vnfct Hnspbil, iwd tire. PVukadR.lpb.va. Gesreril Hospital. Simples bom 
the Wards of these hospitals were satisfactory as controls because almost all 
'he aphasic patients were also ward cases. 

The advantage of this method of selection is immediately obvious. The 
patients admitted to one department of a hospital are more likely to be 
similar to those admitted to another department in all factors except the 
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type of disease which required their admission, than is any other limited 
group of the population which could be studied . 1 The disadvantage, when 
the purpose is the study of mental functioning in normal adults, is also ob- 
vious, for a number of hospital patients certainly cannot be considered 
normal either mentally or physically. At the same time there are many pa- 
tients whose hospitalization is enforced by the need for some particular 
treatment rather than by a serious protracted illness or general debilitation, 
and who, in the opinion of the authors, would be accepted by both medical 
men and psychologists as normal from the point of view of mental func- 
tioning. The problem in the selection of the normal control groups there- 
fore resolved itself into the choice of those patients whose disorders were 
least likely to have any effect on the functions required in the various men- 
tal and educational achievement tests. 

Neurological conditions were immediately excluded from the normal 
control group and in addition it seemed wise to reject all cases of glandular 
disorder, tuberculosis, or syphilis . 2 Aside from these, few types of disease 
had been studied with regard to possible mental changes and there was lit- 
tle material available to determine the choice of patients . 8 Data from tests of 
intelligence in cases of children with diseased adenoids and tonsils were 
not of immediate value because patients coming in for treatment for these 
conditions had not been hospitalized long enough to be studied thoroughly. 
Many of the subjects finally accepted had had tonsillectomies at some previ- 
ous time; very few showed diseased tonsils or adenoids at the time of the 
examination. 

In view of the fact that orthopedic cases were available for study and 
were particularly suitable because they were in good working condition 


1 For a discussion of the possible slight differences in economic /actors, see Weiscnburg and 
McBride, Aphasia. A Clinical and Ptychologiccd Study, laj. 

* It w known that certain glandular conditions are accompanied b y definite mental dunges, 
and various investigations, particularly in the Cerman literature, have indicated that ere « 
characteristic mental disturbances and defects in speech and writing jo tuberculosis, or a 
cent paper, sec J. Kollants, Ober Sprach- und Schretbstorungen im altgemetnen una as 
"kjeine Zeichcn der Gctstcssch wdeh ung" bet Tuber\uloie und im Alter, Arch, t Psyc “ • 


Neereath , rgjj, gg, roj-tgtf. . . IT 

Syphilis is apparently characterized by slightly inferior intelligence test performance, 
condition, as explained in the Army Memoirs, may as well indicate that those or lower in 
gcncc are more likely to expose themselves to infection or to neglect prophylactic measures 
that syphilis results in lower test performance. (Mem. Nat Acad Sa , 1921, 15, lU) J 
case, the matter is controversial, and syphilitics could not safely be accepted for the so 
normal group. 

* For a recent discussion of the problem, sec C. C. Schweninger's excellent summary 
Heredity and Environment, 307—325. 
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and had unlimited time, the study of crippled children made by Fernald 
and Arlitt was of special interest/ These investigators found an average 
Stanford Binet I.Q. of 82.35 for a group of 194 children, a figure which 
would seem to indicate somewhat inferior intelligence. Analysis showed 
that this mean I.Q. was lowered by the inclusion within the group of 
27 cases of spastic birth paralysis (mean I.Q. 69.11) and 15 cases of cen- 
tral nervous system involvement, including congenital lues (mean I.Q. 
75.93) . The groups of poliomyelitis, tuberculosis, nutritional disorders in- 
cluding rickets, infections not involving the central nervous system, and 
trauma ranged from 83.79 t0 &6-53 in f-Q- These groups were still slightly 
inferior, hut apparently not because of the disease conditions, for similar 
ranges in I.Q. were characteristic of the siblings of the crippled children. 
In short, it seemed probable that the crippled children, who were all insti- 
tutional cases, were drawn from inferior family groups in which I.Q.'s of 
about 85 would be typical. 

Another study of interest in view of the large number of gastrointestinal 
cases available for examination was Paulsen’s report on mental and motor 
efficiency in cases of intestinal toxemia. 5 The group under treatment for in- 
testinal toxemia gained 16 per cent more than the control group on reexami- 
nation by a battery of mental tests and tests of steadiness, motor control and 
coordination. The author concluded that this net gain represented the in- 
crease in mental and motor efficiency resulting from the treatment. This 
conclusion might well be questioned, however, in view of certain unsatis- 
factory conditions in the experiment. The test group was selected on the 
basis of the presence of harmful bacteria in sufficient numbers in the feces, 
and the majority when questioned "confessed” the symptoms of intestinal 
toxemia. The control group was matched as nearly as possible in chrono- 
logical age, test intelligence, training, and environment; but no attempt was 
made to determine the presence or absence of intestinal toxemia in these 
cases. A further, though probably less important, defect in the experimental 
conditions is a difference of motive in the two groups: the test cases were 
promised assistance in bettering their intestinal conditions in return for 
work on the tests while the controls “entered the experiment as a matter of 
interest and desire to be of assistance to the examiner.” 

4 M. K. Fernald and A. H. Arlitt, A Psychological Study of a Croup of Crippled Children 
of Various Types, Sch. St Soc , 19*5. iJ > 449~45* 

* A. E. Paulsen, The Influence of Treatment for Intestinal Toxemia on Mental and Motor 
Efficiency, Arch. Psychol., 19*4. No. 69. 



3 s selection and composition of the group 

Ai these report* indicate, there wat little evidence at the beginning of this 
research a* to the effect on intelligence of disease, other than glandular 
disease or disorder* of the central nervous system. During the course of the 
work, however, there appeared a very important study by Dawson and 
Conn, which i* more extensive and more decisive than any previous investi- 
gation * An intelligence test, the Burt Revision of the Binet, was given to 
over a thousand children suffering from types of disease which included 
pneumonia, rheumatism, nephritis, glandular disorders, epilepsy, encepha- 
litis lethargica, and other brain conditions. The findings show clearly that 
non-brain cases made scores on intelligence tests equivalent to those made 
by the healthy population to which the)’ belonged, while the brain cases 
were on the average inferior. In other words, disease, apart from disease of 
the brain or ductless glands, “does not appear to have any appreciable efTert 
on the intelligence.'’ An interesting analysis of mental test performance in 
acute, non-acute, and chronic conditions revealed the fact that the average 
scores were actually higher for patients examined when fevered and acutely 
ill; they were slightly lower for patients classified as chronic than for those 
in the afebrile but non<hronic group, but the difference was not significant. 

The subjects selected for this research were studied during periods of less 
serious illness than those whom Dawson and Conn examined. With so lit- 
tle positive data on the problem, however, it was considered safest to restrict 
the cases to patients with bone fractures or dislocations and to patients re- 
cuperating from some surgical treatment and in good working condition at 
the time of the examination. 

b. Criteria for selection 

For the reasons discussed above, the normal adults were chosen from the 
orthopedic and surgical wards, with further limitations which would make 
them satisfactory as controls for the aphasic group. These limitations were 
as follows: 1 

1. Freedom from any present or earlier neurological, mental, or glandular 
disease 

2. Satisfactory vision and hearing 

3. Age under sixty years 

4 . English as the native tongue. 

* S- Dawson and J. C. M Conn. Intelligence and Disease, Med. Research Council, Spec. 
Report Series, No. 162, 1931. 

T Work was begun at the Orthopedic Hospital where a number of patients were examined 
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c. Method of approach 

When cases which seemed to fill the requirements had been selected from 
the hospital records, or the physician’s report, the examiner proceeded very 
informally to explain the problem and ask for the patient’s cooperation. He 
was told that studies were being made of patients who had speech difficul- 
ties resulting from a “stroke” or injur)’ to the brain, and that these studies 
were handicapped by lack of knowledge as to just how much the average 
man remembered of his schooling, or how much he had learned without 
schooling. It was further explained that we were as yet unable to do a great 
deal for the 3phasic patient because we did not know how the ordinary 
man worked on tests of reading, writing, and arithmetic. Two points were 
always made as clear as possible: i) that the “ordinary man” was the one we 
wanted to study, not the well-educated man; and 2) that while each person 
must do his best, it was not his individual score, but the general results for 
the group which interested us. Sometimes one of these points was more use- 
ful in clinching the bargain, sometimes the other; and occasionally our ob- 
jectives were attained when our explanations did not seem to be compre- 
hended at all. 

All investigators report a certain number of refusals; Miles and Miles 
speak of the “difficulty which we and others have encountered with adult 
subjects in administering intelligence tests." In this work, of the total num- 
ber of patients asked to take the examinations, only 21 per cent refused. 8 

The figures for the three hospitals are: Philadelphia General, 24 per cent; 
Graduate, 19 per cent; and Orthopedic, 18 per cent. 

The motives which led to acceptance in the remaining yg per cent of the 
cases are of some interest. Three types were distinct, but all of these were 
probably present to some degree in most cases. The first was a willingness 

who did not meet the requirements Co r selection which to on had to be laid down. These pa- 
tients were eliminated from the group, but when the final calculations had been made and the 
complete hospital records were again checked, it was found that the group still included 9 pa- 
tterns who did not meet the rigid requirements: 2 of these patients were being treated for 
neunus at the time of the examination, 3 were cases of deformity resulting from poliomyelitis, 
and 4 were cases of tuberculosis of the bone or joint The mean score on the Stanford JSinct 
Scale for these 9 patients was 161 and the mean score on the Pintner Non-Language Mental 
Test 346. Since the means for the total group on these two tests were 164 and 356 respectively, 
and since the group of 9 fell close to the average of the total group on die other tests as well, 
it seemed clear that the inelusion or exclusion of their records would not affect the results 
materially. 

•The patients gave various reasons for refusing Some thought they were too sick; some 
had families who would not approve, and some simply “did not want to be bothered-" 
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to do anything which would help someone else. The second was boredom; 
the patient welcomed any occupation to fill in time. The third was more 
personal, arising from a reluctance to say “no* to the examiner and pride 
at being selected. At the Philadelphia General and Orthopedic Hospitals, 
where work was carried on over long periods of time, there was some good- 
natured banter about "taking letsoni but this publicity was an advantage 
rather than a disadvantage; patients felt that there was a certain amount of 
prestige attached to taking the examination, and a number who had not 
been selected offered themselves as subjects. 

It will have been evident that in approaching the patient, the emphasis 
was placed rather upon achievement in reading, writing, and arithmetic 
than upon intelligence. During the course of the examination, a few of the 
patients asked if some of the tests were intelligence tests, but were not dis- 
turbed upon being told that they were. 

Whether the tests were designed for group or individual application they 
were alt given as individual tests, a very important point since individual 
work enabled the examiner to become well acquainted with each patient, to 
observe his methods of work, and to ensure his steady cooperation. Occa- 
sionally a patient was reluctant to try some particular test; if he objected 
seriously the test was omitted, hut in most cases he could be persuaded to 
attempt the work, and once persuaded, he seemed to take pride in doing 
his best. Cooperation was splendid, and the contact between patient and 
examiner all that could be desired. The advantages which' accrue from a 
favorable personal relation cannot be overemphasized. 


2. COMPOSITION OF THE. GROUP 0 


a. Distribution by diseases 

The distribution of cases according to the pathological condition necessi- 
tating admission to the hospital is presented in the following outline: 
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Due to infection: carbunculus 

Due to trauma or physical agents: third degree burns of legs . • 
ulcer of leg 


• All cases at the Orthopedic Hospital and two at the Graduate Hospital were eaamined by 
Mrs. Gardiner, the remaining eases by Dr. Roe. _ _ . .g j 

l *U* terminology and form of the classification are those gitto in a Standard Clamp 
Nomenclature of Disease, H B. Logie, ed . »9JJ. 
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DISEASES OF THE MUSCDLO-SKEtETAt. SYSTEM 

Of the bones 

Due to infection: chronic osteomyelitis of right or left femur . . 2 

chronic periostitis of right femur . 

tuberculosis of vertebra 

Due to trauma: fracture of vertebra 

fracture of right or left femur 5 

fracture of right or left tibia and fibia 

fracture of coccyx 

Due to unknown causes: osteitis deformans of left fcccuic . . . 1 


Of the joints 

Due to prenatal influences: congenital dislocation of hips . . . 

Due to infection: ankylosis of right hip, tuberculous 

ankylosis of left knee, tuberculous 

arthritis of hip, non-tuberculous • . 

arthritis of knee 

chronic arthritis 

arthritis of sacro-iliac joint 

Due to trauma: sacro-iliac subluxation 

Due to unknown causes: osteoarthritis 

osteoarthritis of knees 


Of the muscles 

Due to disturbances of innervation: paralysis of muscle, post- 
poliomyelitic 3 


DISEASES OF THE CARDIOVASCULAR SYSTEM 

Of the veins 

Due to trauma: varicose veins of the leg 1 


DISEASES OF THE DIGESTIVE SYSTEM 

Of the small intestine 

New growths: adenocarcinoma of intestine 11 x 

Due to unknown causes: duodenal ulcer 2 

Of the colon 

Due to disturbances of innervation: spasticity of colon . . . . 1 

Of the appendix 

Due to infection: acute appendicitis 3 

U At the time of examination, a year before this diagnosis was made, the diagnosis 
"chronic intestinal obstruction.' ‘ 
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Of the amts 

Due to static mechanical abnormality: hemorrhoids t 

Of the tile passages 

Due to infection: acute cholecystitis x 

Due to static mechanical abnormality: cholelithiasis 4 

Of the abdomen and peritoneum 

Due to static mechanical abnormality: /emoral hernia . ... 1 

inguinal hernia 7 


DISEASES OF THE KEXVOUS SYSTEM 3 

Due to unknown causes or causes not determinable: 

trigeminal neuralgia i 

left median and ulnar neuritis i 

UNCLASSIFIABLE, OBSERVATION 6 


b. Distribution by race 

The group reported in this monograph is composed entirely of white 
subjects. 1 * All of them had been born in an English-speaking country and 
65 of the 70 were natives of the United States. Of the remaining cases four, 
2 men and 2 women, had been born in Ireland and one, a woman, in Scot- 
land. Of the 65 subjects, only 2 were Jewish and only 2 came from homes 
where a language other than English was spoken regularly, the language in 
each case being Italian. 

** A Negro group studied by the same methods numbered only 15, and it has not been 
considered worth while to report the results for such a small group in detail. In view of the 
fact that there are few studies of adult Negroes, however, several important findings are 
worth summarizing. It must be noted that the hospital seems to afford a good solution to 
the problem or securing comparable groups of whites and NegToes in the adult period, for the 
patients admitted to the wards of most hospitals, whether they are white or Negro, arc likely 
to be derived from closely similar levels of the population. The probability that the groups 
will be comparable is particularly high when the studies are made in a dty hospital, like the 
Philadelphia General, or in a hospital like the Graduate which serves a district containing a 
large proportion of Negroes of various economic strata. 

The mean age of the Negro subjects, 1 1 male and 4 female, was 40 years. The mean school 
grade completed was €, and the mean occupational rating on the Barr Scale 7.13. These two 
figures may of course reflect inequality of opportunity between the Negro and the white, but 
in any case the Negroes were inferior on all the chief tests except the Porteus Maze, and sig- 
nificantly inferior on sonic, notably the Pintner Non-Language Test and the Stanford Binet 
Vocabulary. The superiority of the Negroes on the Porteus Maze was not statistically signifi- 
cant, but it is an interesting difference, which has not, to the knowledge of the authors, been 
reported by other investigator*. 

The Negro subjects cooperated even more freely than the white subjects. They were imme- 
diately and obviously interested in the tests and they were apparently less self-conscious than 
the white subjects. 
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TABLE IV 

DISTRIBUTIONS BY ACE FOR THE THREE HOSPITAL CROUPS AND FOR 
THE TOTAL TEST CROUP 


PHILADELPHIA 

GENERAL GRADUATE ORTHOPEDIC 

HOSPITAL HOSPITAL HOSPITAL ALL THREE HOSPITALS 


ACE 

St 

F 

U 

F 

St 

F 

Stale 

Female 

Total 

5°~59 

3 

4 

, 

X 

X 

3 

5 

8 

*3 

40-49 

3 

1 

3 

X 

5 

1 

11 

3 

14 

30-39 



2 

I 

7 

3 

*7 

4 

21 

20-29 

a 


3 

a 

3 

5 


7 

>5 

10-19 

1 

* 

1 


4 



1 

7 

Number 

17 

6 

10 

5 

10 

12 

47 

2 3 

70 

Mean 

389 

47-2 

34-3 

36.1 

3*-3 

34.1 

35-tit ■« 

38.o±2.8 

36.1*1.5 

S «”" 

1 1.1 

>3 4 

>3 0 

11.2 

1 1.7 

128 

12 2*1-4 

138*20 

12.8*1.1 


TABLE V 

DISTRIBUTIONS BY SCHOOL CRADE REACHED FOR THE THREE HOSPITAL 
CROUPS AND FOR THE TOTAL TEST CROUP 


SCHOOL 

GRADE 

REACHED 


PHILADELPHIA 

CENERAL GRADUATE 

HOSPITAL HOSPITAL 

U F St F 


ORTHOPEDIC 
HOSPITAL 
U F 


ALL THREE HOSPITALS 
Stale Female Total 


15-16 

13-14 

11-12 

9-10 

7-8 

5-6 

3*4 


B 

4 

16 


6 

26 

15 


Unknown 


Number 


*7 

8 0L4 


27+3 


*3 


8i£6 

3.3-5 


90 


8**3 

3.o±.a 
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C Distribution by age 

Tabic IV shows the age distributions by decades for the male and female 
sub-groups in cadi hospital and for the total group. The mean age for the 
patients at the Philadelphia General Hospital, 41.1 years, was higher than 
that for the total group, while the mean ages for the patients at the Graduate 
and Orthopedic Hospitals, 3^.9 and 33.1 years respectively, were lower. The 
differences, however, were not great. 

d. Distribution by education 

The data on educational level are expressed in terms of the grade the sub- 
ject had reached before leaving school. 15 

Distributions for the separate hospital groups and for the total group arc 
presented in Table V. 1 * There were some differences between the mean 
levels for the groups in the three hospitals: the Philadelphia General fell 
lowest with a mean school grade of yjo, the Graduate next with a mean of 
7-4. and the Orthopedic considerably higher with a mean of 9-0. 

c. Distribution by occupation 

Two occupational scales were employed, the Taussig and the Barr.” The 
Taussig is a discrete five-division scale based on general type of occupation: 

I Professional, II Semi-professional and business. III Shilled labor, IV 
Semi-skilled labor, and V Unskilled labor. The Barr Scale was constructed 
on the basis of a list of a hundred representative occupations, each of which 
was rated by thirty judges according to the grade of intelligence believed 
to be necessary for that particular occupation. 18 Thus it gives an index of 

11 Adjustment was made for the older patients who went to school at a time when twelve 
terms were roughly equivalent to eight grades today, and for the English, Irish, or Scotch-born, 
whose school terms differed in length- When, as in many cases, the patient stated he had fin- 
ished "two or three" or "four or fire" grades, the lower number was always recorded. 

It would have been interesting to record also the number of grades repeated, but this addi- 
tional 6et was not obtained in all cases and indeed it is likely to have been forgotten in many. 

14 la 5 of the 4 cases at the Philadelphia General Hospital in which no statement as to 
education could be elicited, the level was probably lour years or below. The fourth case, how- 
ever, was one of the most intelligent men of the group, and it is probable that he had had at 
least an eighth -grade education. _ , 

The 2 women at the Graduate Hospital whose records lor education are incomplete proo- 
ahly had had the equivalent dF fire years of schooling. _ _ 

18 Toman, Genetic Studiet of Genius. I, Stanford University: Stanford University Press, 

*955. 63-61: 66-69 _ . 

19 Ratings for occupations other than those listed on the scale may he derived by matching 
the occupation to be rated with that which seems most similar to it, or by calculating an inter- 
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TABLE VI 


DISTRIBUTIONS BY TAUSSIC OCCUPATIONAL CLASSIFICATION FOR THE 
THREE HOSPITAL CROUPS AND FOR THE TOTAL TEST CROUP 


TAUSSIG 

CLASSI- 

FICATION 

PHILADELPHIA 

GENERAL 

HOSPITAL 

11/ F 

CRADUATE 

HOSPITAL 

M F 

ORTHOPEDIC 

HOSPITAL 

M F 

ALL THREE HOSPITALS 

Male Female Total 

I 

It 

4 


3 

4 

11 

1 

4 

15 

m 

6 

5 

2 

2 

13 

2 

IS 

IV 

6 

3 1 

4 

4 

n 

3 

18 

V 

1 


3 


4 


4 



TABLE VII 





DISTRIBUTIONS BY BARR OCCUPATIONAL RATINGS FOR THE THREE 


HOSPITAL GROUPS AND FOR THE TOTAL TEST GROUP 



PHILADELPHIA 








GENERAL 

GRADUATE 

ORTHOPEDIC 





HOSPITAL 

HOSPITAL 

HOSPITAL 

ALL THREE HOSPITALS 

RATING 

U F 

M F 

M 

F 

Male 

Female 

Total 

13-00-13 99 




1 


1 

1 

12 00-12 99 



* 

* 

1 

1 

* 

li.oo-u.99 

2 

t 

1 

3 

3 

2 

5 

10.00-1099 

2 

1 

4 

* 


* 

9 

9 00- 9.99 

3 

a 

* 

* 


I 

7 

8 00- 8 99 

3 

1 4 


* 

t 

10 

11 

7.00- 7-99 

5 

3 

3 

1 

9 

* 

to 

6 00- 6.99 

1 

1 1 

1 

3 

4 

4 


5 00- 5 99 

3 

* 

3 




6 

4 00- 4 99 





* 


* 

3 00- 3- 99 

1 


X 


* 


2 

Number 


10 5 

14 

12 

41 

at 

62 

Mean 

7.6 8.4 

86 79 

8.2 

9-4 

8 o±.4 

8 8±-s 

8-3-3 

Sign. 

2.1 0.0 

18 09 

2 6 

2.4 

2 3--3 

2.J±.3 



the intellectual level characteristic of the occupation, while the Taussig 
Scale simply classes together occupations similar in type of work involved. 
Distributions on the Taussig Scale are given in Table VI. IT Inspection of 
iT At the Philadelphia General Hospital 6 women, 5 housewives and one student, could not 





44 


SELECTION AND COMPOSITION OF THE GROUP 


the table will show that the Orthopedic Hospital group contained the larg- 
est proportion of cases from the professional and semi-professional classes. 
The Philadelphia General and Graduate groups contained relatively larger 
proportions of cases in classes III and IV, skilled and semi-skilled labor. As 
a whole the distribution on the Taussig Scale shows that the subjects are 
fairly equally divided among the semi-professional and business, the skilled 
labor, and the semi-skilled labor classes. 

Ratings on the Barr Scale are presented in Table VII. 13 The three occu- 
pational distributions, like the educational, overlap considerably, with the 
mean for the Graduate Hospital (8.36) higher than that for the Philadel- 
phia General (7.73), and the mean for the Orthopedic (8.75) higher than 
that for the Graduate. This overbpping is just what would be expected 
from a knowledge of the populations served by the three hospitals. 19 


3. COMPARATIVE FINDINGS IN FOUR DIAGNOSTIC CROUPS 
The majority of the cases, 45 of the 70, fall into three main groups ac- 
cording to their medical diagnosis: injuries of the bone and dislocations, 
infections of the bones or joints, and diseases of the digestive system. The 
bst group may be divided into two if the cases of hernia or hemorrhoids 
are separated from those of duodenal ulcer, intestinal obstruction, appendi- 
citis, cholecystitis, and cholelithiasis. Before the group of 70 cases is consid- 
ered as a whole, a preliminary comparison of the status of these four diag- 
nostic groups is worth while. 

Table VIII shows the mean levels for the four groups in age, education, 
occupational rating on the Barr Scale, and scores on the Stanford Binet and 
Pintner Non-Language Tests, together with the sigma deviations from the 

be classified on the Taussig Scale. At the Graduate Hospital 4 women could not be classified, 
all were housewives. At the Orthopedic Hospital one man and one woman, each crippled ana 
unable to work, and 5 young men. all students, could not be classified. • 

38 All cases were rated with the exception of one 5 object at the Philadelphia General and 7 
subjects at the Orthopedic Hospital. These were either students or unemployed cripple*. 

After some Consideration housewives were assigned a rating of 8.37. This was obtained by 
averaging tie Farr ratings for the coot and the tailor, a process which seemed w hr roeSssr-w 
by the nature of the occupation. In some cases the raring is too low and in others obviously 
too high. It would probably have been fairer to give each housewife the rating which would 
have been assigned her husband, but this solution did not suggest itself until the opportunity 
for obtaining the necessary information had passed. , 

*• Very few of the ward patients at the Philadelphia General Hospital pay any fees, and 
so fir as is known, none of those included has. At the Graduate Hospital, 4S P° ctt3t 
ward patients paid an average fee of $1-00 per day. At the Orthopedic Hospital, 51 per tent 0 
the ward patients paid an average of St J7 per day. 
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means for the total group in education, occupational rating, and the two 
mental tests. 20 It will be evident that the groups of patients with arthritis 
and bone fractures and dislocations make higher mean scores on the Stan- 
ford Binct and Pintner Non-Language Tests than either of the groups of 


TABLE VIII 

COMPARATIVE FINDINGS FOR FORTY-FIVE CASES DISTRIBUTED BY 
MEDICAL DIAGNOSIS 



FRACTURES 

AND DISLO- 
CATIONS 

ARTHRITIS 

CASTRO- 

INTESTINAL 

OTHER THAN 

HERNIA 

AND HEMOR- 
RHOIDS 

HERNIA 

AND 

HEMOR- 

RHOIDS 

Number of cases 

II 

13 

12 

9 

MEAN LEVELS 

Age 

31-2 

36.X 

37-2 

39-6 

School grade reached . . . 

7-9 

10.2 

7.* 

7-4 

Occupation: Barr Scale . . 

7-9 

9.6 

8.5 

8.6 

Stanford Binct Scale . - - 

168.1 

■78.5 

152.5 

155.0 

Pintner Non-Language Test . 353.3 

SICMA DEVIATIONS FROM THE MEANS 

377-0 

347-i 

353° 

FOR THE TOTAL CROUP 

School grade reached . . . 

—.1 

+■ 7 

— *3 

— .2 

Occupation: Barr Scale 

—.2 

+.6 

+.1 

+•» 

Stanford Binet Scale . . 

+*2 

+.6 

—•4 

-•3 

Pintner Non-Language Test . 

0.0 

+ •2 

—.1 

0.0 

PERCENTAGE DISTRIBUTION 

OF THE CROUP BY HOSPITAL 

Philadelphia General . 

36 

8 

42 

78 

Graduate 

*9 

8 

5° 

■xx 

Orthopedic 

45 

84 

8 

0 


patients with gastro-intestinal disorders. The first two groups surpass the 
mean for the total number of patients on the Stanford Binet Scale while 

*« The Stanford Binet and the Pintner Non-Language represent only two of the large num- 
ber of ttsti used in the ttudy, but art tuRoent for thU brief tomparitue tuteey of the mental 
level* of four diagnostic group*. 



' \6 SELECTION AND COMPOSITION OF THE GROUP 
the second two groups fall below it. The differences are similar to the dif- 
ferences in education, and the latter depend on the differing proportions 
from the three hospitals in the various diagnostic groups. Since the Ortho- 
pedic Hospital is superior to the other two in the educational level of the 
patients studied, the arthritic group, with n of 13 patients from the Ortho- 
pedic, reaches a higher mean level in education than the others. Mean 
scores on the Pintner Non-Language Mental Test, which are probably less 
affected than the Binet scores by educational and cultural status, are similar 
in all four groups. 

The slight differences between the mental levels in the four diagnostic 
groups apparently reflect only differences in selection; it seems clear that 
they do not depend on the positive or negative effects of any one or more 
disease conditions. This is an important point, for there may be less con- 
troversy as to the normality of mental functioning in one diagnostic group 
than in another. In cases of bone injury or of dislocation, when the patient 
has been in good health up to the time of the accident which caused his 
injury, when he is without pain and the treatment is proceeding well, there 
can be no doubt that he reacts normally to the tests given him. In the 
gastro-intestinal cases, in some instances after an abdominal operation, 
there may be some question as to whether the patient reacts normally. It is 
important therefore to note that the patients of this character reached the 
levels expected of them on the basis of their educational status just as those 
of any other group did. In view of the fact that each of the four groups is 
small, the figures are indeed extremely consistent. It goes without saying 
that in the case of all patients every precaution was taken to ensure that the 
findings would be as trustworthy as possible, and examinations were made 
only when the patient was in good working condition and when he felt 
ready for work. 

COMMENT ON THE SAMPLE 

Two criteria were available for placing this small sample of adult sub- 
jects in relation to larger populations: first, the occupational distribution of 
the 1930 census for the city of Philadelphia and, second, the educational 
and occupational data for the very large groups of adult men examined 
during the War. The first criterion, of course, provides only an index of the 
similarity between the occupational distributions for the test group and the 
city population; it does not indicate, as the second does, anything as to the 
educational or intellectual level of the test group. 
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The 1930 census of the United States, which was made during the first of 
the three years in which these subjects were examined, shows occupational 
distributions for the city of Philadelphia which are expressed in Table IX 
in terms of the percentages of the native-white male, native-white female, 
and total native-white population over ten years of age and gainfully em- 


TABLE IX 

PERCENTAGE DISTRIBUTION BY OCCUPATION OF THE NATIVE-WHITE 
POPULATION OF PHILADELPHIA AND OF THE EMPLOYED 
MEMBERS OF THE TEST GROUP 


MALE FEMALE TOTAL 

Philadelphia Test Philadelphia Ten Philadelphia Tert 

population group population group population group 


Agriculture .... 

- 

0 

- 

0 

_ 

0 

Forestry and fishing 

- 

0 

- 

0 

- 

0 

Extraction of minerals . 

- 

3 

- 

0 

- 

3 

Manufacturing and me- 
chanical industries 

42 

39 

29 

18 

37 

35 

Transportation and com- 
munication . . . 

11 

25 

3 

9 

to 

2t 

Trade 

30 

23 

*1 

9 

*5 

>9 

Public service (not else- 
where classified) . . 

5 

O 

_ 

0 

3 

0 

Professional service . - 

6 

2 

>3 

18 

7 

5 

Domestic and personal 
service 

3 

8 

12 

38 

12 

12 

Clerical occupations 

*3 

3 

3 « 

18 

>5 

6 


Note: Dash (-) = less than one per cent. 


ployed in the ten occupational divisions. 21 The corresponding percentages 
in each of the occupational divisions for the test group are placed beside 
those for the city population. Inspection of the table shows a marked simi- 

** TtPemth Ctntvt of the United State/, /93a. IV, Oervpstiont by States, Waihiflgton. 
U-S. Cov’t Priming Office, 19JJ. Table ta, 1412-1415. 

The percentage* falling in each of the ten occupational division* were a*cu!attd separately 
for the native-white male*, the naure-whice female*, and the total native-white population. 
Data for the foreign born were ducardcd became there were only fire foreign bora in the trrt 
group, *nd thne were all native* of Engluh-*praYiflg eountriet and probably better clued 
with the native-white than with the foreign -born pspu'ation. 
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Iarity between the occupational distributions for the test group 3nd for the 
city population. For both the total test group and the white males the per- 
centages in the divisions of manufacturing and mechanical industries and 
of trade arc very nearly the same as the percentages for the city population 
in these two divisions. The only discrepancies which are at all noteworthy 
are in the division of transportation and communication where the per- 
centages for the test group exceed those for the city population and in the 
clerical occupations where the opposite relationship appears. On the whole 
it is clear that the test group, small as it is, represents a good sample of the 
city population From the point of view of the occupational divisions listed 
in the census report. Whether the test group samples the upper, middle, or 
lower strata of these occupational divisions cannot, of course, be deter- 
mined from these figures, and for evidence on this point one must turn to 
the analyses which have been made of the occupations and Army Alpha 
scores of the recruits examined during the War. 

A convenient method of estimating a mean Barr Scale rating for the 
occupations of the men tested by the Army Alpha during the War is pos- 
sible on the basis of Fryer’s and Symonds’ work with the Army dat3. 
Fryer reported the corrected Army Alpha scores for 96 occupations. 82 From 
these Symonds selected the occupations representative of a score of about 
48 on the Army Alpha, that is, a score which appeared to be typical of the 
average man. 23 These occupations, except for a few which were not clearly 
designated, have each been assigned a Barr rating. They are as follows: 


Masons 

Hospital attendants 
Station agents 
Miners 
Teamsters . 

Riggcu . . . 

Boilermakers . . 

Airplane workers 
Factory storekeepers 
Horse shoers . 
Salesclerks . . 
Hostlers (RJt.) . 
Barbers 

Stationary engineers 
Cobblers ... 


8.58 

6.26 (estimated) 
10.26 

4- 29 

5 - 4 * 

8 00 (estimated) 
9.06 (estimated) 
8.65 (estimated) 
ro.61 (estimated) 

■ 5-97 (estimated) 
9.72 

691 (estimated) 
6.92 

951 (estimated) 
6.85 


: Fryer, op. ett , 275-276. 


Symonds, op. ett., 65-78. 
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Horse trainers . - 
Caterers .... 
Bricklayers . 

Auto truck chauffeurs 
Farmers .... 
Bakers .... 
Mean . , . 


7.24 (estimated) 
9.97 (estimated) 
7-77 

7.23 (estimated) 


7-24 

7.91 

8.12 


The range of these occupations on the Barr Scale is surprisingly wide, 
4.29 to 10.61. The mean rating is 8.12. This figure is very close to the mean 
Barr rating of 8.01 which was found for the men of the test group. 21 The 
fact that the men of the test group fall so near the mean rating for the occu- 


TABLE X 

PERCENTAGE DISTRIBUTION 8V SCHOOL CRADE REACHED OF MALES 
IN THE TEST CROUP AND OF THE NATIVE-WHITE DRAFT OF 
THE AMERICAN ARMY 


SCHOOL CHADS 

REACHED 

TEST CROUP 

ARMY 

15-16 

0.0 

2.0 

, 3“ 1 4 

2-3 

3-3 

II— 12 

18.5 

68 

9-IO 

9-3 

It. I 

7 -a 

37-3 

39-8 

5 -« 

25.6 

20.1 

3~4 

7.0 

1 1.2 

1-2 

0.0 

3-8 

0 

0 0 

1.9 

Number of subjects 

43 

60,250 

Mean grade reached 

8.0 

6.9 


pations which have been thought to characterize the man of average intelli- 
gence as indicated by Army Alpha score, together with the similarity in 
occupational distribution shown between the test group and the city popu- 

** The mailing of moil of the women *i houiewitr* at a Barr rating of 8.J7 it the chief 
(actor in the nie of the mean for the total group over that for the men only. At noted be lore, 
that ii *n uniatiiiaciory rating for the bouiewirn, It w probable that the tnno for the men i» 
more truly rtpreientatire of the Ie*<l of the group. 
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lation, is evidence that the test group is a fairly good sample with regard to 
level of occupation, that is, grade of intelligence required, as well as type of 
occupation. 

The education of the adults in the test group may be compared directly 
with the education of the native-white draft of the Army.” The percen- 
tages of each group reaching the various school grades, which are presented 
in Table X, arc very similar. 20 The mean for the test group is somewhat 
higher than that for the Army, but the chief reason for this difference may 
well be the fact that the test group was studied fifteen years after the Army 
group. 

The various comparisons which have been made show that the test group 
is a good sample of the occupational distribution of the city of Philadelphia, 
that it has a mean Barr rating which is very close to that of the occupations 
considered to be representative of the man of average intelligence as indi- 
cated by Army Alpha score, and finally that, although its mean school grade 
is somewhat higher than the Army mean, it is nevertheless very similar to 
the large Army sample from the point of view of educational distribution. 
The findings are sufficient to indicate that it is a good sample of the middle 
levels of the population, and as such a valuable basis for the study of mental 
functioning in the adult period. 

* s The tat group, 31 noted above, contained five foreign-born subjects, but these were 
from English-speaking countria and nearer the general level of the native-white than of tbe 
foreign-bom population. ^ 

* a The statistics for the Army are derived from Table 302 of the Mem. Nat. Acad. Sa+ 

* 5 . 758 . 
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THE TESTS 

1. Basis for Selection 

2. Outline of Tests 

3. Examination Periods 

I. BASIS FOR SELECTION 

T HE tests were chosen to meet the problem of the aphasia study, that is, 
to establish and analyze lor the aphasic patients whatever changes 
might occur in language and thought processes. 1 The complexity of the 
changes in aphasia, as they had repeatedly been demonstrated in the litera- 
ture, made it necessary to study a tvidc variety of performances, ranging 
from the simplest speech functions to complicated processes of abstract 
thought. Only the simplest speech processes, such as repeating easy words 
and phrases or naming common objects, could be assumed to be entirely 
within the ability of all adults except the low-grade feebleminded and the 
mentally deteriorated. For all the more complicated performances, includ- 
ing responses on both intelligence tests and educational achievement tests, 
tentative adult norms had to be established before the aphasic performances 
could be evaluated. Consequently all the tests of the aphasic battery, with 
the exception of the simplest speech tests, were given to the normal group. 

1 The fact that the test! were selected primarily for the study of aphaiu resulted in the in- 
clusion of tests which would not ha.\c been chosen for a study of adult intelligence alone. 
{The battery Used with the flon-aphasic adults included the Stanford Bmet Scale, only a few 
separate tests of which were employed in the study of aphasic patients ) There were in- 
cluded, first of all, testa for specific processes, such it the particular reading abilities analyzed 
by the Gatei Tests of Visual Perception and Selection. Performances on these tests arc interest- 
ing in eases of aphasia where reading difficulties are pronounced but comparatively uninterest- 
ing in normal adults, and results of such tests tvill not be presented. Secondly, since the un- 
derstanding of spoken language and of printed material is sometimes unequally disturbed in 
eases of aphasia, both ora! and printed forms of several tests were arranged Both forms were 
given to the normal control group, and the results ate rryoned in th» study Thirdly, in view 
of the aphasic limitations us speaking and wnong, it was necessary to include lesti which 
would not be greatly influenced by pacticu'at handicaps in either of these methods of fe« 
iponse. There are two cases in point here first the use of both the Momion McCall Spelling 
Sale, a word dictation test, and the Cates Oral Spelling Test, and, second, the use of the 
Chapman Unspeeded Rea ding -Comprehension Test in addition to the Thorndike McCall 
Reading Scale, because reading comprehension on the former can be indicated without $ ver- 
bal response, simply by pointing to the incorrect word 
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While some few intelligence tests for adults exist, notably the Army 
Alpha and the various examinations for high school or college students, 
none was sufficiently comprehensive or sufficiently analytic for the aphasia 
study. In any case, all the tests for college students could immediately be 
eliminated as too difficult for the average adult. It was therefore necessary to 
construct the battery largely from the mental and educational achievement 
tests in use with children. Among these tests, an effort was made to choose 
those with content best adapted to adult interests. In their use with children 
most of these tests have been shown to be reasonably valid and reliable. 
For others no data have been presented. It must be noted, however, that the 
tests cannot be assumed to be valid and reliable for adults on the basis of 
results for children. 


2. OUTLINE OF TESTS 3 

In the following outline of tests and in the presentation of results in 
Chapter IV, the tests have been divided into three groups: Language Intel- 
ligence, Educational Achievement, and Non-Language. This division is a 
more or less artificial one, and there are many points of overlapping be- 
tween the groups, but it is a convenient practical arrangement. 

I. Language Intelligence Tests 

a. Controlled Association Tests 

1. Opposites Tests 

Oral Form: Whipple’s List IV 

Printed Form: new list constructed to approximate List IV u 
difficulty 

2. Mixed Analogies Tests 

Oral Form: constructed from Van Wagcnen’s Test B 
Printed Form: constructed from Van Wagenen’s Test A 

b. Kelley Trabuc Completion Exercise Beta 

c. Vocabulary Tests 

1. Oral: Stanford Binet Vocabulary Test 

2. Printed: Thorndike Test of Word Knowledge 

d. Stanford Binet Scale 

e. Miscellaneous Tests 

1. Absurdities Tests: Oral and Printed Forms constructed from a 

surd statements suggested by Ballard and others 

2. Pintner-Toops Revised Directions Test 

* For sources and for construction and content of new tests, see Appendix. 
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II. Educational Achievement Tests 

a. Reading Tests 

z. Gray Oral Reading Tests 

2. Thorndike McCall Reading Scale 

3. Chapman Unspeeded Reading-Comprehension Test 

b. Spelling Tests 

1. Gates Oral Spelling Test 

2. Morrison McCall Spelling Scale 

3. Stanford Achievement Dictation Test 

c. Writing a Letter 

d. Arithmetic Tests 

1. Stanford Achievement Computation Test 

a. Stanford Achievement Reasoning Test 

III. Non-Language Tests 

a. Tests from the Pintner Paterson Performance Scale 

1. Mare and Foal 

2. Sc gum 

3. Substitution 

b. Pintner Non-Language Mental Test 

c. Cancellation 

d. Immediate Recall: Kuhlmann Binet 

e. Drawing a Chair 

f. Goodenough Drawing Scale 

g. Portcus Maze Test 

3. EXAMINATION PERIODS 
Ten to fifteen hours were required to give the complete battery of tests. 
Since the normal working period was generally 3 half hour to an hour, 
with one and sometimes two periods a day, the examination required ten 
days to two weeks to complete. A number of patients were discharged be- 
fore all the tests had been given, but since they were not given in a definite 
order all of the tallies are less than the total number of cases in the group. 3 

* Although no regular order was set up lor all the tests, some particular rules were ob- 
served: where both oral and printed forms of a test were being used, the oral was given first; 
and the Pintner Non-Language Test was given before the Digit-Symbol Substitution of the 
Pintner Paterson Scale. 
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distributions arc shown in Tabic XII. The total distribution on this test is 
one of those most nearly approximating the normal curve. 

Both vocabulary tests, the Stanford Binet and the Thorndike Test of 
Word Knowledge, were easy to administer and fairly satisfactory in dis- 


TABLE XI 

RANGE OF SCORES, MEAN SCORES, AND SIGMAS ON ALL TESTS OF 
THE BATTERY 

N XANCE MEAN SICMA 


I. LANGUAGE INTELLIGENCE TESTS 

Oral Opposites: Score . . 65 

Time . - 65 

Printed Opposites: Score . 51 

nine . 51 

Oral Analogies .... 41 

. Printed Analogies ... 60 

Kelley Trabue Completion 

Beta 57 

Stanford Binet Vocabulary . 69 

Thorndike Word Knowledge 54 

Stanford Binet Scale ... 65 

Ora! Absurdities .... 44 

Primed Absurdities ... 41 

Pintner-Toops Directions . 62 

It. EDUCATIONAL ACHIEVEMENT TESTS 
Gray Oral Reading Para- 
graphs, Set IV ... . 49 

Thorndike McCall Reading 59 

Chapman Reading-Compre- 
hension 42 

Gates Oral Spelling . 53 

Morrison McCall Spelling . 59 

Stanford Achievement Dicta- 
tion .... -5® 

Writing a Letter . . 36 

Stanford Arithmetic Compu- 
tation .... 55 

Stanford Arithmetic Reason- 
ing . . - - • * 54 


9-20 

17.82:0.3 

242:0a 

1 4 1-37 

71.022.9 

23 - 5 * 2.1 

9-20 

16.2210.4 

3.12:0.3 

130-30 

74 - 3*4 9 

34 - 5 * 3-4 

6-31 

20 6 2:i.o 

6.52:0.7 

7-34 

21 .02:0.9 

6.720.6 

4.0-10.2 

7-82:0 2 

1.32:0.1 

25-86 

54 - 5 * »-9 

15.62:1.3 

18-98 

61.822.8 

2042:2.0 

96-134 

163.623.1 

25.02:2a 

0-10 

6.r ±0-4 

2.42:0.3 

I-IO 

7.12:0.3 

2.22:0.2 

5-27 

15 62:0.6 

4.82:04 


129-40 

• 4-33 

73 - 7*3 0 

24 72:0 6 

20.622.1 
4 62:04 

3-39 

9-36 

9-49 

16421.0 

26.121-0 

3982:1.1 

6 62:0.7 
6.92:0.7 
8.32:0.8 

32-121 

2.8-80 

90 62:2.8 
54 *o -2 

21.12:2.0 

1.220 1 

i °-45 

264 ±r,i 

8.320.8 

8-38 

22.820 7 

54205 


Noth: Each mean and sigma is followed by its standard error. 
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TABLE XI (coni.) 



N 

XANCE 

MEAN 

SIGMA 

III. NON-LANGUAGE TESTS 

Pintner Paterson 

Marc and Foal: Tune . . 

50 

52-8 

28.6^1.3 

io-5±t.i 

Srguin 

46 

=*-9 

I5-5^=0-9 

6 ^dzo£ 

Substitution .... 

45 

3 2 5-50 

I04.8;il6.5 

44- 1 — 4-4 

Pintner Non-Language . 

5 b 

80-536 

356-}— >4-4 

107.8^10.2 

Imitation 

5b 

0-12 

6-3±°-4 

2.72:0.3 

Easy Learning .... 

56 

8-5O 

37-8 -='-5 

io.<j:£r.o 

Hard Learning . . - 

5b 

£-« 

33&J.3 

9^drO-9 

Drawing Completion . . 

5*> 

6-20 

T 3-4— °-4 

3-3— °-3 

Reversed Drawing . . . 

5b 

0-9 

5-5— 0.3 

20±02 

Picture Reconstruction 

5^ 

2-28 

1 2J :±; 0.7 

5-4 ±o-4 

Cancellation 

<6 

5-34 

19-6— 1-° 

6.82:0.7 

Kuhlmann Immediate Recall 

44 

433-57 

173-0^:10.7 

70-7 ±7-5 

Drawing a Chair .... 

27 

6-26 

*5-7— 1-° 

5:2 it 0.8 

Goodenough Drawing . . 

53 

4-50 

2S2±I.5 

I0.62:*.0 

Porteus Ma2e 

45 

48-216 

i 69-5±5-5 

34«=M7 


Nora: Each mean and sigma is foUoured by its standard error . . 


c riminatin g the adult subjects. The distributions are presented in Tables 
XIII and XIV. The Stanford Binct, as is usually the case, furnished an ex- 
cellent pr eliminar y index of the patient’s level together with various clues 
as to bis cultural development, his intellectual reactions, and his interests 
and attitudes. Both tests were found to be fairly reliable. The correlation 
between the two lists of the Stanford Binet was +-95 with a standard error 
of J 32 for 41 cases- 3 The correlation between odd and even items of the 
Thorndike Test of Word Knowledge was +.94 ±sn for 52 cases. 

The distributions of mental ages on the Stanford Binet Scale appear in 
Table XV. 4 Mental ages for these adult subjects are to be understood in a 

•It is interesting to note that the nan store* on the two lists were almost the same: tin 1. 
27.76 and Urt 2, 26.34. . 

4 The procedure in giving the Stanford Binet Scale was to indude die tests from the X-jni 
level through the par where all tests were filled. Tests below the X-par level were p **n 
when necessary to establish a basal age. 

In 4 of the 65 cases only the starred tests were given. 

The correlation bnween the scores on the complete scale and the scores on the abbreviated 
scale of started tests is +.97 for 61 eases. The mean score for the complete scale was 

163.6, for the starred scale 165 4. _ 

Since "scatter" of Stanford Binet scores is usually considered an important index in do-Wg 
with psjchcitjcs, it was thought interesting to compute the scatter for the normal adults of this 
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TABLE XII 

DISTRIBUTION OF SCORES ON THE KELLEY TRABUE COMPLETION 


EXERCISE BETA 


scour 

MSU 

nmu 

TOTAL 

io,o-io -^9 

a 


a 

9-5- 999 


1 

2 

9 9H9 

5 

a 

7 

8-5- S-99 

6 

1 

7 

8 o- 8 _j 9 

6 

6 

21 

7-5- 7-99 

7 

3 

20 

7 . 0 - 7-19 

5 

1 

6 

6 . 5 - 6.99 


3 

3 

60 - 6.49 

4 

1 

5 

5-5- 5-99 


1 

2 

5-0- 5-49 

1 


2 

4-5- 4 99 

4 4-49 

1 

1 

a 

Number 

37 

20 

57 

Mean 

7% 

7.62 

7.80 

Sigma 

1.29 

1^9 

I * 2 7 


TABLE XIII 

DISTRIBUTION OF SCORES ON THE STANFORD El NET VOCABULARY TEST 


SCO*E 

MALI 

FEMALE 

TOTAL 

84-89 

a 


2 

78-83 

2 

1 

3 

7^-77 

2 

2 

4 

66-71 

3 

3 

6 

60-65 

7 

1 


54-59 

9 

4 

n 

48-53 

8 

2 


42-47 

7 

3 

10 


3 

3 


30-35 

2 



=4-29 

1 


1 



23 

69 

Mean 

55-30 

52.83 

54-49 

Sigma 

i3°5 

15.10 

*5-5 9 
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limited sense, not as units of development corresponding to chronological 
age, but simply as scores. The highest possible Stanford Binct score is 334. 
Only one individual attained this maximum and only 4 others reached or 
exceeded 210. There may well be an end error in these 5 scores. Further- 
more, there may possibly be an end error in the scores of the 15 other cases 
passing one or more tests at the XVIII-year level. A glance at the distribu- 
tion, however, shows no piling up of scores in the higher steps. 


TABLE XIV 

DISTRIBUTION OF SCORES ON THE THORNDIKE TEST OF WORD KNOWLEDGE 


SCORE 

MALE 

FEMALE 

TOTAL 

9I-IOO 

2 

2 

4 

81-9O 

3 

r 

4 

71-80 

9 

4 

*3 

61-70 

6 

5 

ii 

51-60 

5 


5 

41-50 

4 

2 

6 

31-40 

2 

4 

6 

21-30 

1 

1 

2 

11-20 

3 


3 

Number 

35 

19 

54 

Mean 


62.21 

6r.7$ 

Sigma 

21m 

20-47 

20.40 


Despite all the justifiable criticisms which have been made of the Stan- 
ford Binet Scale, particularly for use with adults, it stands up well under 
analysis: inspection of the percentages passing each sub-test, as shown in 
Table XVI, indicates that few of the tests are badly misplaced. 5 For the 

group. The measure used was Woodworth's spoiled SJX (R. S. Woodworth, Ta^de for Find- 
ing the SJ3. of the Mental Age Distribution, New York, 1917: Confuting the Standard 
Deviations and Probable Error of a Binet Mental Age, Psychol. Bull., igaS. 25* j* 

should be recognized that as the basal age becomes higher, the possible size of the S-D U 
progressively lessened and no correction for this change is psssxbV. For the normal adults, the 
figures are: 

Mean SJ>. Sigma N , 

Men 3 X *74 >*■>* < 2 

Women 31.26 10,32 33 

The measure of scatter plotted against Stanford Binet score gives a triangular-shaped &«n- 
bution probably respiting in large part from the effort mentioned above of the raising of the 
basal age. 

8 Several of the tests require particular comment- The weights, IX 2, used at the Graduate 



LANGUAGE INTELLIGENCE TESTS 59 

TABLE XV 


DISTRIBUTION OF SCORES OiV THE STANFORD BINET SCALE 


SCORE 

MALE 

FEMALE 

TOTAL 

220-234 

205-219 

X 

4 

2 

6 

190-204 

4 

1 

5 

175-189 

6 

2 

7 

160-174 

XX 

5 

16 

145-159 

9 

6 

15 

I30-I44 

5 

5 

10 

1 15-129 

2 

1 

3 

IOO-IX4 

85-99 

Number 

4 2 

X 

1 

23 

X 

1 

63 

Mean 

167.9 

155.8 

163.6 

Sigma 

26.0 

27.9 

25.0 


adults examined, the order of the tests from the most to the least difficult 
'vas as follows: 6 


u XVIII 

fc 

4 - 

5 - 


6 

4 

5 

* 


Ingenuity 
Reproduces passage 
7 digits backwards 
Vocabulary (75) 
Paper cutting 


and Philadelphia Central HospitaU, were discovered, after mu,h of the testing had been done, 
» have developed inaccuracies. Since they may have been inaccurate for some time all the 
resell, vdiith lid previously been *«= discarded l*' /auk “ ■» “* ■ b =« : 

0011.0= i. based on He records Sou. » cares =»'y. J< “ *= Unhoped* and lie balance St He 
Philadelphia General Hospital. Since she Onbopedie group .. ,pnerally superior ro lie oHee 
I»o. the* relause d.Beult, cl tb. rert doer not depend on an overere.ghung of lour-grade cases. 

No rank sva, assumed to tie Code Te.r. XVI 6 This »as one or He few ten. for ududl 
administrative ddSculSe, scere encountered In rath hupual man, of ^ He refused to 

alrempt It a. all, odiers .railed in a half headed tarn, but accomp .sired very l.rtle. The res. 
-a. not only unsanitary a. au mden of orleltrgeoee fo, He* adults, bu, „ was so distaste- 
ful to man, of rhem Hat !, war eventually dropped from He sires, and reflated lbs purpose, 

Wiring by an alternate. second le* whieli presented some administrative diS- 

Tbe Free Assoc, atron, XI.™ ‘ „ -„„b!e Hree He adult, mere 

decidedly more reifLisoous Han children Some o! Hem complsmrf He «It made 
Hem "feel foolish.” (Obreev.non. of lebavior sremed to mdieate Ha. He ,e„ wa, someone, 
an mde. of emotional nab, hr, .) In.ere.ongly enoogh. He Negioe. appeared be more eon,, 
fomblc to His test urinition fiaO a bb r eviatei Stanford Eincr Scale are ranked in order of 
dddX for^v”* m“le*V sundae r.ukmg of reds for He white male, .r H„ group gives a 
rank correlation of +.86 with the Army * l5t - 
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6. 

xvr 

Al.x 

28 syllables 

7 • 

XVIII 3 

8 digits Ibnvards 

8. 

XVI 

I 

Vocabulary (65) 

9 - 


5 

6 digits backwards 

10. 


2 

Fables 

in. 


4 

Enclosed boxes 

12. 


3 

Difference between abstract words 

13 * 

XIV 

6 

Clock 

14. 


Al.i 

7 digits forwards 

15 * 

XII 

3 

Ball and field 

l6. 


6 

5 digits backwards 

( 17 . 

XIV 

5 

Arithmetic reasoning 

[l8. 

X 

6 

60 words 

19 . 

XII 

4 

Dissected sentences 

20. 

XIV 

1 

Vocabulary (50) 

21. 

X 

3 

Designs 

22. 

XIV 

3 

President and king 

2 3- 


4 

Problems of fact 

24. 

XII 

.5 

Fables 

25. 


8 

Similarities 

26. 

XIV 

2 

Induction 

2 7 • 

X 

4 

Reading and report 

28. 

XII 

7 

Pictures, interpretation 

29. 

X 

Al.2 

20-22 syllables 

3 °* 

xn 

1 • 

Vocabulary (40) 

3 r - 

IX 

6 

Rhymes 

{ 3 2 - 

X 

Al.i 

6 digits forwards 

( 33 - 

IX 

2 

Weights 

34 - 


4 

4 digits backwards 

135 - 

XII 

2 

Abstract words 

' 3 6 - 

X 

2 

Absurdities 

[ 37 - 

IX 

5 

3 words in sentence 

( 38 - 

VIII 

1 

Ball and field 

39 - 

IX 

1 

Date 

40. 

X 

1 

Vocabulary (30) 

41. 


5 

Comprehension 

42. 

IX 

3 

Makes change 

43 - 

VIII 

2 

Counts 20-1 

44. 


4 

Similarities (2) 

45 - 


5 

Definitions 

46. 

VII 

3 

5 digits forwards 

47 - 


6 

Copies diamond 

48. 


Al.2 

3 digits backwards 

49 - 

VI 

2 

Mutilated pictures 
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TABLE XVI 

PERCENTAGE OF THE TEST CROUP PASSING EACH ITEM OF THE 
STANFORD Ft NET SCALE 


TOT 

M 

r 

TOTAL 

TUT 

M 

r 

TOTAL 





T tAJL Jt (tout ) 




I. Right and le ft 

3. Mutilated picture* 

too 

too 

100 

Al.l. 6 digit* forward* 

90 

87 

89 

too 

55 

98 

AIj. ao-aa ijllablfi 

8a 

86 

84 

3. Count* ty pennies 

4. CanjTtSfaiion 

too 

100 

z 

mi* ill 




5. Namesroin* 

100 

too 

ICO 

1. Vocabulary (40) 

90 

75 

Sj 

6. ifr-lS lyllablr* 

too 

100 

ICO 

3. Abstract wordi 

97 

87 

94 





j. Ball and fie! J 

60 

48 


ti 1* vti 




4. Diisetted sentence* 

62 



1. Ftngetx 

too 

too 

too 

5. r»Ne» 


74 

7* 

3. Picture*, dcicrip- 




6 . 5 digit* backward* 

64 

48 

59 

trwi 

too 

100 

100 

7. Pieturr*, Interpre- 




3. 5 digit* forward* 

too 

9 ,J 

sS 

tation 


;S 

7* 

4. Ties Vow knot 

too 

too 

ICO 

8. Similarities (3) 

75 

70 

7* 

5. Difference* 

IDO 

too 

ICO 

6. Coptr* diamond 

100 

V« 

98 

Tin air 




A! 1. Pay* of week 

IDO 

IDO 

I DO 

1. Vocabu'ary (jo) 

«9 


64 

Al-J. 3 digit* backward* 

too 

96 

98 

3. Induct ion 

85 

61 






3. Pretident tnd king 

7* 

48 

66 

TIA* V7T! 




4. Problem* of /art 

69 

61 

66 

». Pa't and field 

100 

«7 

95 

5. Arithmetic reason- 




3. Count* 30-1 

100 

S C 

95 


7* 

44 

Ci 

J. Comprehendo-i 

too 

too 

too 

6 . Clock 

55 

39 


4 . Similarities (3) 

100 

9« 

9* 

Al.i. 7 digit* forward* 

64 

39 

35 

5. Definition* 

6. Vocabulary <3o) 

ICO 

9C 

98 

t Vocabulary {63) 

16 

39 

28 

Tt»* II 




3 Fable* 

JS 

35 

37 

I. Date 



97 

3 Difference between 





9« 

86 

89 

abstract word* 

36 

44 

38 

3. Make* thin gt 

4. 4 digit* backward* 

97 

*00 

98 

4 Etv: loved boxet 

4» 

26 

37 

97 

87 

93 

3 6 digit* backward* 

3« 

36 

3 9 

3. 3 word* In sentence 

97 

9f 

95 

6 Code 

ao 

- 

'5 

6. Rhyme* 

*5 

86 

85 

A) 1. aS lyllable* 

3 J 

*3 

a 3 

Abl. Month* 

too 

too 

too 

tHi *vm 




nuj 




1 Vocabulary (73) 

»4 

13 

1 4 

1. Vocabulary (30) 

97 

100 

98 

3 Paper cutting 

30 

32 

30 

3. Abiufdstie* 

95 

91 

94 

3. 8 digit* forward* 

3* 

17 

36 



64 

64 

4 Reproduce* passage 

10 

9 

9 

4. Reading and report 

: 74 

81 

77 

j. 7 digit* backward* 

14 

0 

9 

3, Comprehcniion 

97 

loo 

98 

6. Ingenuity 

10 

c 

6 

6. Co werdi 

61 


6 l 
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TEST RESULTS 


All the other tests were passed by each member of the group who fell 
sufficiently low to have been given them. The mean score for repeating 
digits forward was 6.69 =fc. 12 with a sigma of 1.02 dz. 09, and the mean score 
for repeating digits backwards 4.87 dz. 14 with a sigma of 1.16 rt.io. An end 
error was present in the distribution of the former, but not in that of the 
latter. 

Neither of the Absurdities Tests is adequate in the present form. Of the 
two the Oral gave the better distribution. The construction of a longer ab- 
surdities test would probably be well worth while, for the detection of ab- 
surdities seems to be an excellent test problem for adults. 7 

The Pintner-Toops Revised Directions Test proved a convenient intro- 
duction to longer printed tests, and the total distribution of scores was sat- 
isfactory, but the diversity of the activities required by the various items 
made it difficult to interpret the total score. 

2 . EDUCATIONAL ACHIEVEMENT TESTS 

The scores on the Cray Oral Reading Test are combined time and error 
scores. 8 The results reported in the second section of Table XI are based 
on only those subjects able to attempt reading as difficult as that of Set IV, 
that is, on 57 of the 68 subjects to whom the oral reading test was given, 
and the mean of 73.7 is consequently slightly lower than a mean for the total 
group would be. The distribution shows a marked negative skew. 

The two reading comprehension tests, the Thorndike McCall Reading 
Scale and the Chapman Unspeeded Reading-Comprehension Test, were 
both fairly satisfactory for the adults examined. The first had a slight advan- 
tage from the point of view of the regularity of the distribution, which J* 
presented in Table XVII.® The second, however, is a little better adapted 
to adults from the point of view of content. 

The distributions for the three spelling tests all showed more or less nega- 
tive skew. Those for the Gates Ora! Spelling and the Stanford Achievement 
Dictation Test showed much less skew than that for the Morrison McCall » 

r The combined score* on the Oral and Printed Absurd! ties Tests correlated Curly highly 
with the scores on the Stanford Binet Scale- +.75 with a standard error of .07 for 40 f** 3 - 

* Analyses of the errors made on the Gray Oral Reading Tests, Sets II, III, and IV, snowed. 

55 per cent mispronunciations, 11 per cent repetitions, ja per cent omission,, and tl per ccn< 
insertions. _ 

* The Chapman Test was unfortunately given to only 43 cases, and this small number may 
account for some irregularities in the distribution. 
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TABLE XVII 


DISTRIBUTION OF SCORES ON THE THORNDIKE MCCALL READING SCALE 


SCORE 

MALE 

FEMALE 

TOTAL 

31-33 

4 

I 

5 

28-30 

8 

6 

t4 

25-37 

9 

1 

10 

22-34 

8 

8 

16 

X9-iI 

4 

2 

6 

lf>-l8 

4 

3 

7 

*3-i5 

Number 

z 

38 

21 

59 

Mean 

24.92 

24.24 

24 68 

I 

co 

4-75 

4.4O 

463 


The Gates Test was quickly administered and readily accepted by adult 
subjects. The correlation of alternate items is £6 dr.04. For the 53 adults 
examined the order of difficulty of the words is as follows: 


it 

13. built 

25. arrangement 

but 

(14. travel 

26. approaches 

are 

(15. factory 

27. necessary 

day 

16. marriage 

28. restaurant 

nine 

17. afraid 

29. difficulty 

card 

f:8. visitor 

30. acquaintance 

me 

1 1 g. measure 

J31. magnificent 

do 

20. information 

{32. extraordinary 

mail 

21. circular 

33. miscellaneous 

catch 

(22. amusement 

34. architecture 

teach 

(23. estimate 

35. conscientious 

prison 

24. elaborate 

36. hippopotamus 


The Morrison McCall Spelling Scale is not sufficiently difficult for adults 
from average or near-average levels of the population, 10 but the Stanford 
Dictation Test is fairly satisfactory from the point of view of its difficulty. 
A certain amount of negative skewness, such as appeared on both the Gates 
and the Stanford Tests may be characteristic of the scores for adult subjects 
no matter what the content of a legitimate spelling test, for a large number 
of the words which may justifiably be included, without making the test a 


14 The Mornson McCall Scale is satisfactory for low-grade subjects and is useful iq patio- 
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TABLE XVIII 


DISTRIBUTION OF SCORES ON THE STANFORD ACHIEVEMENT ARITHMETIC 
COMPUTATION TEST 


SCORE 

MALE 

FEMALE 

TOTAL 

45-4S 


1 

Z 

41-44 

2 


2 

37-4° 

2 


2 

■ 33-36 

4 

2 

6 

29-32 

5 

1 

6 

25-28 

ix 

5 

1 6 

21-24 

6 

6 

12 

17-20 

6 

2 

8 

13-16 

r 


1 

9-12 


1 

1 

Number 

37 

18 

55 

Mean 

26.92 

25-33 

26.40 

Sigma 

6.59 

7-35 

8.31 


stunt performance, are within the ability of many adults. In other words, 
there is reason to believe that spelling is an activity which is fairly ade- 
quately acquired at adult levels by persons of moderate as well as persons 
of higher intelligence and education. 


logical conditions where sentence dictation tests are impossible. The correlation of alternate 
items is +.94 ± 02. 

The words of List 1 fall in the following order of difficulty: 


2. top 

3- red 

4- book 

5- play 


f 9- lay 
10. add 


”■ T* 

13. body 

14. tired 

15. alike 

16. shut 

17. reason 


18. perfect 

19. nearly 

( 20. fifth 

22. led 
23. folks 
24. anyway 
25. desire 
2 6. therefore 
27. written 
J28. omit 
•J29- arrange 
I30. popular^ 
31. celebration 
[3 2. search 
4 33. interest 
l34. height 


35. reference 

36. diride 

37. minute 

38. pleasant 

39. necessary 

40. annual 

42. character 

43. separate 

44. committee 

45. immense 

46. acquaintance 

47. discipline 

48. prinaple 

49. judgment 

50. lieutenant 
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Scores for the test of Writing a Letter arc scale values of the Nassau 
County Supplement to the Hillegas Scale, a scaling system which is prob- 
ably less adequate for the productions of adults than for those of children. 
The writing of letters was a task which some of the adults did not accept 
readily, but which would be more generally acceptable if a satisfactory pre- 
text for the letter were arranged. 

The distribution of scores on the Stanford Achievement Arithmetic 
Confutation and Reasoning Testt, which are given in Tables XVIII and 
XIX, are both good. The tests are apparently quite adequate for adults of 
the status of those examined. 


TABLE XIX 

DISTRIBUTION OF SCORES ON T1IE STANFORD ACHIEVEMENT ARITHMETIC 
REASONING TEST 


scour 

MVLE 

FEMALE 

TOTSL 

3S-39 

t 


1 

32-35 

3 

2 

5 

aS~3t 

6 


6 

2-1-37 

6 

1 

7 

20-23 

12 

5 

i 7 

1&-19 

6 

7 

13 

12-15 

3 

1 

A 

8-1 1 


1 

1 

Number 

37 

>7 

54 

Mean 

ajSS 

20-47 

22.80 

Sigma 

5-97 

6.05 

5-39 


3 . NON-LANGUAGE TESTS 

The large majority of the tests of the Pintner Paterson Performance 
Scale are far too easy for adults from average or near-average levels of the 
population, A few of them, the Mare and Foal (time score), the Segu'tn, 
and the Substitution, were moderately discriminative for the adults ex- 
amined; and the ranges of scores, means, and sigmas for these have been 
presented in the third section of Table XI. None of these tests is particu- 
larly valuable in studying the normal adult nor interesting to him, but sev- 
eral of them are valuable in studying low-grade or deteriorated cases. 
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TABLE XX 


DISTRIBUTION OF SCORES ON THE PJNTNER NON-LANGUAGE MENTAL TEST 


SCORE 


Ff MALE 

TOTAL 

5=5-575 

1 


I 

475-52-1 

7 

I 

8 

425-474 

4 

3 

7 

375'424 

5 

3 

8 

325-374 

8 

5 

23 

275-324 

3 

5 

8 

225-274 

7 


7 

175-224 

1 


1 

1 25-174 

1 


1 

75-124 


3 

2 

Number 

37 

>9 

5 6 

Mean 

3 6 5-7S 

337-68 

356-43 

Sigma 

IIO.67 

98.26 

107.80 


TABLE XXI 

DISTRIBUTION OF SCORES ON THE FORTEUS MAZE TEST 


SCORE 

MALE 

FEMALE 

TOTAL 

216-327 

X 


1 

2O4-215 

3 

4 

7 

I92-203 

3 

1 

3 

180-191 

6 

3 

9 

168-179 

7 

2 

9 

I5&-167 

3 

4 

7 

I44-J55 

> 

2 

3 

>32-143 


2 

2 

130-131 


1 

X 

I08-U9 

1 


X 

96-IO7 

84-95 


> 

1 

72-83 

60-71 

48-53 


* 

I 

Number 

24 

21 

45 

Mean 

>77-75 

160 00 

16947 

Sigma 

21-53 

39.28 

34.60 
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Tlie rinincr Non-Language Test is an interesting one; it presents mate- 
rial and requires performances not involved in the majority of other tests, 
but any exact determination of the abilities required by the various sub- 
tests for adult subjects or the reliability of the estimates is impossible. The 
distributions of total scores arc given in Table XX- The distributions of 
scores on all the subtests but the Easy Learning were fairly satisfactory. 
Experience with the adult subjects showed that they often had difficulty 
understanding the problem of the Reversed Drawing Test. 

The Cancellation Test was included as a check on the results of the more 
complex tests. The distributions were positively skewed. 

The scores for the Kuhlmann Test of Immediate Recall are based on 
both rime and errors, with the higher scores indicating the poorer perform- 
ances. The distributions were good. Attempts to determine the patient’s 
method on this test were comparatively fruitless. Observations of behavior 
and introspections indicated at least three types of attack: visual, {an- 
aesthetic, and verbal. The first two appeared most frequently. Apparently 
more than one of the three types was often used by the same subject. 

The test of Drawing a Chair from a model was moderately discrimina- 
tive, but it >s not a test which adults attempt readily, and a few refused to 
try it a: all. 

The adults examined often showed a reluctance to attempt the Good- 
enough Drainng Test also. The scores distributed well, but it is highly 
probable that the}' are not at all comparable to similar scores obtained by 
children. 

The Forieus Maze Test, unlike the drawing tests, was well received by 
practically all subjects; it is apparently as interesting to adults as to chil- 
dren. The distributions are shown in Table XXI. 15 

If if to be noted that till ten. which WJI iJJtO to harrry about a war after the be- 
ginning of the study, happened to h: given to a rdirivtjy snuEer proportion of the teen and 
a relatively larger proportion of the vreenen than any other test. This situation does not account 
for the {airly large sex difference in favor of the men, bowerer. became the ma omitted were 
nonlj from the Orthopedic Ilatottal, that is. the group which tended to gw the highest 
scores. 



V 

COMPARATIVE FINDINGS FOR MEN AND WOMEN 


1. Mean Scores 

2. Variabilities 

I. WEAN SCORES 

T HE group of men, containing 47 cases, is about twice the size of the 
group of women, and this proportion usually holds for the scores on 
any test. The two groups are similar in age and in educational and occupa- 
tional status. The mean age lor the men is 35 years and the mean for the 
women 3S. Each group falls at the eighth grade in educational level. The 
mean occupational rating on the Barr Scale lor the men is 8.01, for the 
women 8.77. 1 

The most striking fact evident from a comparison of scores for men and 
women is the closeness of the means for the two groups on the large ma- 
jority of tests. As inspection of Table XXII will show, some of the mean 
scores are practically identical. The slight disparities shown on others prob- 
ably result from fluctuations of sampling. It is noteworthy, however, that 
the means for the men are superior to those for the women on all but three 
of the tests included in the table. None of the differences is statistically sig- 
nificant. Those most nearly approaching significance appear for the Draw- 
ing Completion Test of the Pintncr Non-Language Scale, the Stanford 
Binet, the Kuhlmann Immediate Recall, and the Drawing of a Chair, with 
differences divided by the sigmas of the differences of 2.43, 1.72, x-^g, and 
i-49 respectively. It is particularly interesting that no significant sex differ- 
ences in means were found on either verbal or numerical tests, for such dtfi 
ferences have often been reported in the literature, especially in studies of 
children and adolescents. 2 

1 The slightly higher mean for women on the Barr Scale is not an altogether satisfactory 
criterion because the figure may be artificially raised by the rating of 8 37 given the house- 
wives. There is no reason to suppose, however, that the mean educational level for women is 
not so accurate as that for men. and the equality of the two groups in this regard justifies direct 
comparison 

* It is true, however, that the three tests in which the mean score for women svas slightly 
superior to that for men— the Oral Analogies, the Thorndike Word Knowledge, and the Let- 
ters — were all verbal tests, and that four of the five sub tests of the Stanford Binet in which the 

women excelled the men Reading and Free Association at Year X, Dissected Sentences at 

Year XII, and Differences between Abstract Words at Year XVI— were also verbal. 



TABLE XXIII 


AGE DECADES TWENTY THROUGH FIFTY 


73 



Pintner Non-Language 
Kuhlmann Immediate Recall 
Porteus Maze 
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2. VARIABILITIES 

Because of the small groups there is little decisive evidence on the ques- 
tion of differences in variability. The sigmas shown in Table XXII, like 
the means, are in most eases close. Among the few fairly large differences 
in absolute variability those appearing for the Substitution, the Kuhlmann 
Immediate Recall, and the Portcus Maze Test each depend on an extreme 
case, but that for the Stanford Dictation Test reflects the more platykurtic 


TABLE XXII - 

MEAN SCORES AND SICMAS FOR MALES AND FEMALES 


TEST 

MEAN 

Hale female 

SIGMA 

Hale Female 

Printed Opposites: Time . . . 

71.O 

79.6 

39-3 

22.0 

Oral Analogies 

20.0 

212 

6.7 

6-3 

Printed Analogies 

2 M 

20.3 

7.0 

6.1 

Completion Beta 

7-9 

7.6 

*■3 

i-3 

Stanford Binct Vocabulary . . 

55*3 

S=-8 

13.1 

15.1 

Thorndike Word Knowledge . . 

61.5 

622 

2 M 

20.5 

Stanford Binct Scale .... 

167.9 

155-8 

26.0 

27.9 

Gray Oral Reading, Set IV . . 

72.7 

75.6 

19.0 

23.0 

Thorndike McCall Reading . 

24-9 

242 

4.8 

4-4 

Chapman Reading-Comprehension 

16.8 

15.8 

6 -3 

72 

Gates Oral Spelling . . ; . 

26.7 

35.1 

7-3 


Stanford Dictation ..... 

91.0 

89.8 

239 

i9-5 

Writing a Letter ..... 

5 -* 

5.6 

12 


Arithmetic Computation . 

26.9 

25*3 

7-° 

7-4 

Arithmetic Reasoning .... 

23.9 

20.5 

6.0 


Seguin 

15*3 

j6.o 

4-4 

33 

Substitution 

99.9 

114-7 

29.1 


Pintncr Non-Language .... 

365.8 

337-7 

1 1 0.7 

98.3 

Imitation 

6.8 

55 

29 

22 

Hard Learning ..... 

34.1 

3 2 -7 

9-3 

10.9 

Drawing Completion 

14.1 

12.0 

3° 

3-2 

Reversed Drawing .... 

5*7 

5-1 

2.0 


Picture Reconstruction . 


ir-7 

3-8 

7-5 

Kuhlmann Immediate Recall . . 


196.0 

58.3 

83.6 

Drawing a Chair ..... 

16.8 

14.0 

5-5 

4-3 

Goodenough Drawing .... 

182 

282 

10.4 

11.0 

Porteus Maze 

177.8 

1600 

21.5 

39-3 
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distribution of scores for the men as compared with those for the women. 
Expressed in terms of Pearson’s Coefficient of Variation, the relative vari- 
abilities for the remaining tests showing large differences in absolute vari- 
ability are as follows: 

Males Females 


Printed Opposites: Time . . 56.1 28.9 

Gray Oral Reading, Set IV 26.1 30.4 

Pintner Non-Language 30.3 29.1 

Picture Reconstruction 28.6 64.1 


Thus it is evident that the men are relatively much more variable than 
the women in speed on the Printed Opposites Test, and slightly more vari- 
able on the Stanford Dictation Test, while the women are much more vari- 
able than the men on the Picture Reconstruction Test. 

Findings for men and women with regard to inequalities in the perform- 
ance level in any individual are reported in Chapter VII. For the sake of 
completeness, it is to be noted here that there is no appreciable difference 
between the group of men and the group of women in mean or sigma of 
die index taken to represent individual inequalities m performance level. 
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73 



Pintner Non-Language . 
Ktihlmann Immediate Recall 
Portcuj Maze 
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cnee of the subjects in the twenties. The close similarity of all four decades 
from the point of view of educational and occupational status indicates that 
they may justly be compared for test performances, and that differences in 
test performances may be attributed to differences in age . 5 

Calculation of the P values for the differences between the mean test 
scores reported in Tabic XXIII shows no significant differences between 
means at any one-decade or two-decade interval. For the three-decade in- 
terval from twenty to fifty, however, there is a significant rise in the case of 
the Thorndike Test of Word Knowledge and there are significant declines 
in the case of the Printed Analogies and the Seguin Form board. 

From inspection of Table XXIII it will be evident that, while the differ- 
ences in means are in most cases comparatively slight, there is a * cry inter- 
esting division of the tests according to their course with age. All the lan- 
guage tests, with the exception of the Printed Analogies, show higher 
means in the thirties than in the twenties; while the non-language test*, 
with the exception of the Kuhlmann Immediate Recall, show no such 
rise.* Their means tend rather to decline steadily from the twenties 
through the fifties, and the Primed Analogies is like the non language tests 
in this respect. The means for the language tests all decline from the thirties 
to the forties, but there is not always a further decline from the forties to the 
fifties, and in the case of the Thorndike Word Knowledge and the Stan- 
ford Dictation Tests there are rises from the forties to the fifties. On the 
Thorndike Test of Word Knowledge the mean score in the fifties is higher 
than that in anv other decade.* 
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2 . FINDINGS FOR THE AGE GROUPS TWENTY THROUGH THIRTY-FOUR AND 
FORTY-FIVE THROUGH FIFTY-NINE 


When two larger age groups are selected for comparison, the one com- 
posed of all subjects aged twenty through thirty-four and the other of all 
subjects aged forty-five through fifty-nine, the mean score for the older 
group is significantly lower than that for the younger in the case of lour 
tests: the Printed Analogies, the Seguin, the Substitution, and the Pintner 
Non-Language. The comparative findings are shown in Table XXIV. In 
no case is the mean score for the older group significantly higher than that 
lor the younger. On both vocabulary tests, however, the means for the 
older group are slightly higher than those for the younger, and on lour 
other language tests — the Completion Beta, the Thorndike McCall Read- 
ing, the Gates Oral Spelling, and the Stanford Dictation— and on both 
arithmetic tests, the means for the older fall only slightly below those lor the 
younger. 

Differences in variability between the younger and older groups are on 
the whole small. In 10 of 17 cases the older group is apparently the more 
variable. The quartile deviations for the two groups on the tests of the pre- 


ceding analysis are as follows: 

Younger group 

Older group 

Oral Analogies 

.... 3° 

6.3 

primed Analogies ... . . 

• 4-5 

5-3 

Completion Beta 

.8 

1. 1 

Stanford Binet Vocabulary 

. 7.8 

ri.8 

Thorndike Word Knowledge 

15.8 

r 3 .o 

Stanford Binet 

12 8 

23.3 

Thorndike McCall Reading . 

4-5 

3-3 

Gates Oral Spelling 

3-3 

5.8 

Stanford Dictation . 

20.1 

16.8 

Arithmetic Computation 

5-5 

4-5 

Arithmetic Reasoning . . . 

. 5.8 

3.8 

Mare and Foal 

4-3 

4-5 


Psychot., ig32, 23. 279-186 ) If the scores from hi* tabic arc grouped 10 decade periods, there 
arc 1.030 for age* under twenty, 88g for the twenties, *24 for the thirties, 71 for the forties, 
and 10 for the fifties The distributions seem to indicate that there were different types of sam- 
ple at decades forty and fifty. Calculation of the mean scores for the three younger age groups 
shows them to be 63 1, 7* a. and 60 1 respectively. The differences in 


e from below 
a the thirties are both significant, with dtfter- 


twenty to the twenties and from the tv - • 

encci divided by the sigmas of the difference* of 7 4 and 5 8 Thus Graces findmgs show „ 
early and significant nse in average test score followed by a dectuie wh.ch is also significant 
and which comes at the twenty-thirty interval. 
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Younger group Older group 


Seguin 2.5 2.0 

Substitution 14.5 23 5 

Pintner Non-Language 51.0 62.3 

Kuhlmann -Immediate Recall 47.8 41.5 

Porteus Maze 13.5 14.3 


The problem of age and test performance will be taken up again in 
Chapter IX, after the correlations with age have been discussed, but in this 
connection it may be noted, first, that differences in mean score on the test 
performances analyzed are comparatively slight between the third and the 
sixth decade. Performances involved in the language tests, with the excep- 
tion of the analogies, are all relatively well maintained with age. In the case 


Table xxiv 

MEAN SCORES FOR AGES TWENTY THROUGH THIRTY-FOUR AND FORTY-FIVE 
THROUGH FIFTY-NINE, AND VALUES FOR THE SIGNTFrCANCE OF 
THE DIFFERENCES BETWEEN THE MEANS 



ACE 20-34 

N Mean 

ACE 45-59 

N b lean 

P VALDES 

School grade reached . . . 

26 

9° 

22 

7.6 

.6-.5 

Barr Occupational Rating . . 

25 

>■4 

=5 

8.7 

.6-.5 

Oral Analogies .... 

16 

22 

18 

>9 

.I-.05 

Printed Analogies . 

24 

24 

22 

18 


Completion Beta .... 

21 

8x> 

22 

75 

• 3--2 

Stanford Binet Vocabulary . 

23 

55 

24 

S 6 

•9-.S 

Thorndike Word Knowledge 

24 

.59 

»9 

68 


Stanford Binet Scale . 

26 

172 

23 

>55 

.1-05 

Thorndike McCall Reading . 

25 

2 5 

21 

24 

.6-5 

Oates Oral Spelling 

22 

27 

19 

25 

•8-7 

Stanford Dictation 


93 

22 

92 

■9-.8 

Arithmetic Computation . 

25 

23 

17 

26 

•4-3 

Arithmetic Reasoning . - 


24 

1 6 


.8-7 



26 

*7 

33 

.1-.05 


l8 

>4 

16 

18 

O-.Ol 

Substitution 


S6 

*4 

IXJ 


Pintner Non-Language 

25 


18 

3<>3 


Kuhlmann Immediate Recall 


167 

>3 


— '* 

Porteus Maze 

*7 

>77 

»9 

>58 
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of the vocabulary tests there are even slight increases in the mean scores of 
the older groups, and on the Thorndike Word Knowledge the mean score 
for the fifties is significantly higher than that for the twenties. The second 
important point to be noted is that the most marked changes with increas- 
ing age are the declines shown on the analogies and on the non-language 
tests. On the Printed Analogies and on the Seguin Formboard the de- 
creases in mean score between the twenties and the fifties are significant. 
When the larger age groups twenty through thirty-four and forty-five 
through fifty-nine are compared, there are again significant differences in 
mean score on these tests and in addition significant decreases in mean 
score on the Substitution and on the Pintner Non-Language Tests. These 
declines cannot in every case be attributed to the more generally recognized 
characteristics of behavior changes with age, for example, to diminishing 
speed or manual dexterity. Neither of these factors enters into the Printed 
Analogies Test, for example. It is possible that older persons have increas- 
ing difficulty with the analogies or the non-language tests because they be- 
come less able to adapt to the particular situations involved; it is also pos- 
sible that the peaks in the development of the performances involved are 
actually reached early in life. In all probability the latter is the better expla- 
nation, for observations of the patients and their ready adaptation to most 
of the test situations in question, as well as the character of some of these 
situations, would seem to rule out the first possibility. 
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THE RELATIONSHIP BETWEEN VARIOUS PERFORMANCE 
' LEVELS IN THE INDIVIDUAL CASE 

j. Method of Analysis 
2. Results 


I. METHOD OF ANALYSIS 

W ITH records for each case on a large and varied battery of tests, the 
material was particularly well suited to a study of the equalities or 
inequalities in performance levels in the individual. For adults studies of 
this sort have never been carried far because of the brief examinations 
usually employed. Yet the problems which may be attacked by such an 
analysis are of great interest. It is important to know how much variation 
in performance level the individual shows on a variety of tests of intelli- 
gence and educational achievement; what the relationship is between this 
variation and sex, age, education, and the grade of the performance as in- 
dicated by test score; and finally what facts the individual case records will 
yield as to characteristic constellations in mental test performance, that is, 
groupings which appear when certain test scores are high in comparison 
to other test scores. 

To make a survey of different test performances it is, of course, necessary 
to convert the various test scores into measures which will be direcdy com- 
parable. Standard measures were computed from the constants o t e s- 
tributions of the total populations . 1 Since the groups were small and the 
distributions only approximately normal, the technique of using stan ar 
measures is not altogether justifiable, but it furnishes a satisfactory indica- 
tion of the trend of the results. Only tests giving fiirly good distributions 
were considered for the analysis, and from among these ten were chosen 
which would cover a number of different types of activity: the Kelley 
Trabue Completion Beta, Stanford Binet Vocabulary, Printed Analogies, 
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Thorndike McCall Reading, Gates Oral Spelling, Letters, Arithmetic 
Computation, Pintncr Non-Language, Kuhlmann Immediate Recall, and 
Porteus Maze Tests. 

Twenty of the women, excluding the 3 who took fewest tests, and 20 of 
the men selected at random were chosen as the first groups. When certain 
marked differences between the men and the women were found in the 
correlations between the index of individual variation and other variables, 
a second group of men was selected, including the 20 of the 27 remaining 
male subjects who had taken the greatest number of tests. The standard 
measures for each of the tests selected, the mean of these measures, 3nd 
their average deviation from the mean standard measure were determined 
for each individual. This average deviation (A.D.) was taken as a measure 
of the variations in performance levels in the individual case, and was cor- 
related with mean standard measure, age, and education. When the statisti- 
cal analyses had been completed, the individual case records were plotted 
on graphs showing the relative position of the standard measures for the 
ten tests analyzed, and these graphs were then compared. 


2. RESULTS 


The mean standard measures and the means and sigmas of the average 
deviations for the three groups are as follows: 


Females . . 
Males, Group A 
Males, Group B 


Mean standard Mean Sigma 

measure AD. AD. 

• —.IQ .49 .17 

.00 .44 .17 

• +-I3 -5° -24 


There is a close correspondence between the three groups in the means of 
the average deviations, but the women are relatively low in mean standard 
measure and the men of the second group relatively high. The men of this 
Group B are also slightly more variable than the subjects of the other 
groups. 

The correlations for each group and for the two groups of men combined 
are given in Table XXV. For such small numbers the standard errors are, 
of course, high. 

According to Fisher’s criterion of the significance of a correlation, only 
the correlation of —.53 between A.D. and education in Group A, which 
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has a P value of .02, can be taken to indicate a definite relationship. 2 It 
seems probable that the higher correlations throughout this group depend 
on some factor in its selection. The findings as a whole indicate that there is 
little, if any, relationship between inequality in test performance as meas- 
ured by average deviation and general level of test performance, age, or 
education. 

Studies of the individual case records are interesting. Thirty-two of the 
60 cases analyzed (23 of the men and 9 of the women) make standard 
scores on all tests within plus or minus one sigma of their mean standard 
measures. In other words, slightly more than half the group and an almost 
equivalent proportion of men and women show no great individual varia- 
tion in the ten test performances. It must be noted that these include not 


TABLE XXV 

CORRELATIONS OF THE AVERAGE DEVIATION OF THE STANDARD MEASURES 
WITH OTHER VARLABLES 



FEMALES 

Group A 

MALES 
Group B 

Groups A and B 

Mean standard measure 

— .OOI-+-J3 

-•49±-i7 

—.04 ±^3 

—.2I^:.I5 

A g' 

-f-.22rfc.2i 

+•34— - 20 

-h.r2rfc.22 

-f-.l8rfc.25 

School grade reached . 

— .OO4 ±.23 

— 53 ±.i 6 

— .orrfc.23 

— .I7lfc.l5 


Note: Each correlation i$ followed by its standard error. 


only tests manifesting a fairly high degree of relationship, as reported in 
Chapter VIII, but also tests such as the Porteus Maze and the Kuhlmann 
Immediate Recall which are much less closely related to the majority of the 
verbal tests. Comparison of these cases with those giving one or more test 
performances which fall more than one sigma from the mean standard 
measures shows slight differences in age and education; as one would ex- 
pect, the subjects with the smaller variations are on the average younger 
(three years m the case of the men and two in the case of the women) and 
slightly higher in educational level (half a year for the men and a year for 
the women) than the subjects with one or more performances deviating be- 
yond one sigma from their respective mean standard measures. The situa- 


* Op. eii , Table V, A 
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tion has been suggested by the correlations with average deviation reported 
in Table XXV, but only one of these was found to be significant. 

Among the 28 cases whose performances do not fall within plus or minus 
one sigma of the mean standard measure, n deviate more widely than this 
in only one performance. These single and relatively large deviations ap- 
pear more often for the Porteus Maze than for any other test (4 cases). 

Inspection of the graphs showing the mean standard measures for each 
case reveals no characteristic constellations among the test performances 
analyzed. There are a few cases showing relatively high or relatively low 
non-language performances, as indicated by the three non-language tests, 
but for the group at large there arc no clear-cut subdivisions in the pattern 
of the variations. Nor is there any indication of generally high or generally 
low performances on the so-called educational achievement tests in cases in 
which the educational level is considerably above or considerably below the 
average* 

The absence of large deviations in the majority of cases is a demonstra- 
tion of the fact that adults who do well on some of the tests in question 
usually do well on the others, and conversely that those who have difficulty 
with some tests are likely to have difficulty with others. The further evi- 
dence from the analysis of individual case records shows that particular 
constellations of abilities, as indicated, for example, by high scores on non- 
language tests with low scores on language, arc the exception rather than 
the rule; for the most part, the adult subjects show relatively small devia- 
tions from their respective mean standard measures, and these have an ir- 
regular character, or at least one which cannot be ordered according to a 
logical analysis of the nature of the tests. 

* The relative performance levels on educational achievement and intelligence rests will be 
analyzed in the section on Education and Test Performance in Chapter IX, where a number 
of graphs arc presented showing various performance levels in individual cases. 
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CORRELATIONAL FINDINGS 


1. Correlations with Age 

2. Correlations with Education 

3. Correlations with Occupation 

a. Chi-Square determinations, Taussig ratings 

b. Correlations with Barr Scale ratings 

4. Intercorrclations between the Most Important Tests 

a. Correlations with the Stanford Binet Scale 

b. Correlations with the Kelley Trabue Completion Exercise Beta 

c. Correlations with the Vocabulary Tests 

d. Correlations with the Thorndike McCall Reading Scale 

e. Correlations with the Gates Oral Spelling and the Stanford 

Achievement Dictation Tests 

f. Correlations with the Printed Analogies Test 

g. Correlations with the Stanford Achievement Arithmetic Tests 

h. Correlations with the Pintner Non-Language Test 

i. Correlations with the Goodcnough Drawing Test 

j. Correlations with the Porteus Maze Test 

I. CORRELATIONS WITH AGE 

TN general the correlations between age and non-verbal tests arc negative 
X and of relatively high degree, ranging up to —.5 3, while those between 
age and verbal tests are of relatively low degree, but still negative. As de- 
termined by Fisher’s criterion, the only reliable positive correlation between 
age and a verbal test appeared for the Thorndike Word Knowledge. 1 Re- 
liable negative correlations appeared for the Oral and Printed Analogies, 
the time score on the Mare and Foal, the Segrnn, the Substitution, the Pint- 
ner Non-Language and the Porteus Maze Tests. Correlations between age 
and thirteen of die more important tests of the battery are presented in 
Table XXVI. 

Apparendy the wide range of the group had little effect on the degree of 

1 Since the numbers were in each case comparatively small, the standard errors were high. 
Each correlation was tested for significance by Fisher's formula (op. eil.. Table V, A) and was 
taken to indicate the presence of an actual relationship between the two variable* only if it tad 
a P value of M2 or less for a group of the jittn tire. 
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Correlations between education and test score were calculated for nine- 
teen of the tests of the battery. In all cases the correlations are positive and 
in all but two cases they are reliable; the two exceptions are those for the 
Kuhlmann Immediate Recall and the Goodenough Drawing Tests. Thir- 
teen of these correlations are given in the second column of Table XXVI. 


TABLE XXVII 


PARTIAL CORRELATION'S INDICATING THE RELATIVE IMPORTANCE OF THE 
FACTORS OF SCHOOL GRADE REACHED AND TEST INTELLIGENCE IN 
VOCABULARY AND EDUCATIONAL ACHIEVEMENT TEST SCORES 



EDUCATION 

STANFORD BINET 


First 

C emit A 

Firrt 

$eeonA 

TEST 

crier 

order* 

order 

order* 

Thorndike Word Knowledge 

.46i.II 

.I2i-I4 

•6a±.09 

^Si.ia 

Gray Oral Reading .... 

.36^.13 

.i8i.i4 

-39i.i3 

jai.14 

ihomdike McCall Reading . 

.63 i .oS 

.38^.12 

.75^.06 

.6ad:.09 

Chapman Reading-Comprehension 

-63i.ro 

•3 6 ±->4 

.Si i.06 

•jfii.oS 

Gates Oral Spelling .... 

■ 59~-°9 

-20±.I3 

•6Si.oS 

•57*-m 

Stanford Dictation 

.60-*-. 09 

-23*13 

.74^.06 

.57i.ro 

Arithmetic Computation . . . 

.67 -*-.oS 

-49*-" 

i>7i-oS 

44*-*2 

Arithmetic Reasoning .... 

-59— '°9 

-=6i-I3 

.8 1 ±.05 

■-*•07 


Note: Each correlation is followed by its standard error. 


• Stanford Binet out. f Education out. 

The relatively high correlations between scores on the tests and school 
grade reached do not necessarily indicate that success in the tests is a func- 
tion of education, for education may as well be considered the dependent 
as the independent variable and some of the variations in test score with 
education may depend on the relationship existing between intelligence and 
the amount of education. If the Stanford Binet score be considered an in- 
dex of intelligence, it is possible to determine by the partial correlation 
technique the degree of relationship between education and test score when 
the effect upon these of the variation in intelligence is discounted. It seemed 
worth while to calculate these partial correlations for the Thorndike Test 
of Word Knowledge and for the various educational achievement tests, suc- 
cess In all of which might be expected to depend largely on the amount o 
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Thorndike McCall Reading . . . 
Chapman Reading-Comprehension 

Gates Oral Spelling 

Stanford Dictation 

Arithmetic Computation . . . 

Arithmetic Reasoning . . . . 


21.2 per cent 

30.4 per cent 
xi.o percent 
17.6 per cent 
10.1 per cent 

30.5 per cent 


A comparison of these figures with those £iven above for the variation 
In test score with education shows that for every test except the Arith- 
metic Computation, the percentage of variation with the Stanford Binet 
score is higher than that with the school grade completed. For most of the 
tests, including the Thorndike Test of Word Knowledge, the Thorndike 
and the Chapman reading comprehension tests, the Stanford Dictation, 
and the Arithmetic Reasoning, the differences are marked. 


3 . CORRELATIONS WITH OCCUPATION 


a. Chi-Square determinations, Taussig ratings 
The degree of association between location in the Taussig classification 
and score on some of the tests was studied by means of the Chi-Square 
technique. 6 Group I, Professional, and Group II, Semi-professional and 
Business, were combined for this purpose, as were Groups IV and V, Semi- 
skilled and Unskilled labor. Scores on the tests were thrown into two 
classifications, above and below.thc mean. The tests used and the P values 
for the Chi-Square of each are given below; 


Taussig — Age 

Taussig — Education 

Taussig — Stanford Binet . . - • 

Taussig — Completion Beta . . - 

Taussig — Stanford Binet Vocabulary 
Taussig — Thorndike McCall Reading 
Taussig — Gates Spelling . 

Taussig — Printed Analogies . * • 

Taussig — Arithmetic Computau'on . 

Taussig — Scguin 

Taussig — Substitution . . . • 
Taussig — Pintncr Non-Language . 
Taussig — Porteus Maze . . • 


N 

Cht-Square 

Pialuet 

5 2 

715 

-03 

47 

3.85 

-15 

48 

7.66 

.05-.02 

43 

16.74 

O-.OI 

52 

IO.49 

O-.OI 

44 

H.38 

o-.or 

3S 

6-33 

.10-.05 

44 

3.22 

•20 

44 

7-4® 

.IC-.O? 

3 6 

2.76 

•3- J 

36 

128 

•7-5 

4* 

.70 

.9 -.8 

33 

.78 

•7-5 


• H_n ». tUaifcd ... uf ° r *• Ti "” i S 11 ,hi ‘ 

analysis are mostly men. 
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The results indicate a division between the verbal and the non-verbal 
tests; the Printed Analogies falls with the non-verbal group. There is a 
significant association between Taussig classification and scores on the 
Completion Beta, the Thorndike McCall Reading, and the Stanford Binet 
Vocabulary Tests, and probably a significant relation between the Taussig 
and the Stanford Binet Scale as a whole. Among the verbal tests, only the 
Gates Spelling and the Printed Analogies fail to show a real association 
with the Taussig classification. Of the non-verbal tests, not one is associated 
with this classification of occupations. 

The presence or absence of association is strikingly evident from the con- 
tingency tables, three of which arc reproduced in Table XXVIII. 


TABLE XXVIII 

CONTINGENCY TABLES FOR TAUSSIG RATINGS AND VARIOUS TEST SCORES 


landtt 

TAUSSIG CTOUPS 

III IV and V 

Total 

Stanford Binet scores below 160 . . . 

3 

6 

*3 

22 

Stanford Binet scores above 160 . . . 

13 

6 

7 

26 

Total 

16 

12 

20 

4 3 

Completion Beta scores below 7.80 . . 

1 

5 

16 

22 

Completion Beta scores above 7.80 . . 

12 

5 

4 

21 

Total . 

*3 

10 

20 

43 

Pintner Non-Language scores below 350 

6 

5 

xo 

21 

Pintner Non-Language scores above 350 

7 

6 

7 

20 

Total 

*3 

it 

n 

4 1 


b. Correlations with Barr Scale ratings 
The correlation between Barr ratings and age is +.16 ±.13, and that be- 
tween Barr ratings and education -{-.48 ±ai. The latter is reliable. 

Correlations with Barr ratings were computed for one or more of the 
tests in each group, vocabulary, reading, and so on. They are all positive 
and range from .07 ±.14 with the Completion Beta to JSx ±jog with the 
Stanford Arithmetic Reasoning. It is interesting to find that the neat to the 
lowest of those correlations calculated is +.12 ±.16 with the Porteus Maze 
Test. Thirteen of the correlations with Barr Scale ratings are presented in 
the third column of Table XXVI. 





83 CORRELATIONAL FINDINGS 

4. INTER CORRELATIONS BETWEEN THE MOST IMPORTANT TESTS 

Of the 666 intcrcorrelations possible for the 37 tests reported in this mono- 
graph, 172 were calculated. In addition to the intcrcorrelations of the most 
important tests of the batter)', this number included correlations calculated 
for some particular purpose, for example, to find the degree of correspond- 
ence between oral and printed forms of a test, or to determine the interre- 
lationship of sub-tests in a scale. 0 For economy in presentation only the in- 
tercorrelations of the most important tests will be reported. These are 
shown in Table XXIX. The first eight tests would generally be considered 
predominantly verbal, and their intercorrebtions are relatively high. It is to 
be noted, however, that the correlations with the Printed Analogies are 
considerably lower than those with the other tests of this group. They are 
lower than the correlations between the language and the arithmetic tests, 
though higher than the correlations between the language and the non- 
language tests. 


a. Correlations with the Stanford Binet Scale 

Reference to Table XXIX will show that the Stanford Binet, the com- 
posite test of socalled general intelligence, docs indeed show the highest 
inter-test correlations, on the average, of any test reported. 7 The correla- 
tions with the Printed Analogies and the two arithmetic tests are about as 
high as those with tests of the language block. The scatter diagram for the 
correlation between the Stanford Binet and the Printed Analogies Tests, 
which is presented in Figure r, shows the relationship to be fairly close; 
most of the scores cluster near the regression line. The same clustering is 
apparent in the diagram for the Stanford Binet correlation with the Arith- 
metic Computation, and to an even greater extent in that for the correlation 
with the Arithmetic Reasoning Test. The correlation between the Stanford 
Binet and the Pmtner Non-Language Test, -{-.69 is high in compari- 
son to the findings of other authors on children. Morganthau reported a 

* A few correlations with the Cancellation Test were calculated in order to determine^ what 
relationships existed between this comparatively simple test and the more complex tntcHijjcBW 
tests. The correlation with the Stanford Binet Scale was 4-J3 —.14 for 46 cases, with tbe ^ 
Completion Beta 4- .40^.13 for 43 cases, and with the Plainer Non-Language Test — " 

* Th” higheM^correlaaoa with the Stanford Binet is that of the Chapman Reading-Compre- 
hension Test: -J-.Sj ins for 4a cases. 
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correlation of +.44 :£.o6 for 97 children, aged ten to sixteen. 6 * 8 The differ- 
ence might result from the wider range in the adult group, but this explana- 
tion seems unlikely in view of the fact that the Stanford Binet shows a low 
negative relation to age and the Pintner Non-Language Test a fairly high 



Figure r. Correlation between the Stanford Binet 
and the Printed Analogies Tests: 
r = 4-72; b = -{-290 


6 D. Morganthau, Some Well-Known Menial Tests Evaluated and Compared, Arch. 
Psychol , 1922, 7, No. 52, 38 

Walters found an even lower correlation, + »3. for * group of 165 children in grades VI B 
and VII A. The Stanford Binet correlations with the sub-tests of the Pintner Non Language 
were —.04 with Imitation, —n with Hard Learning, +23 with Drawing Completion. 

+.32 with Reversed Drawing, and +.26 with Picture Reconstruction It » to be noted, how- 
ever, that while nearly all of these children were born in the United States, more than two- 
thirds of them came from families of foreign extraction, so that there may have been in- 
equalities in facility with the English language (F C Walters, A Statistical Study of Certain 
Atpects of the Time Factor tn Intelligence. Teach Coll. Contri. to Edue, 1927, No. 248, 
34-350 
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negative relation. It is impossible to check by partialing because of the 

curvilinear relationship between age and Stanford Binet score. 

The correlation between the Stanford Binet and the Goodenough Test of 
Drawing a Man, on the contrary, is much lower than that usually found for 



Figure a. Correlation between the Stanford Binet and the Goodenough 
Drawing Tests: r = +.39; b = +.87 


children. It is 17 to 46 points lower than the correlations reported by Good- 
enough for children aged four to ten.® The scatter diagram, reproduced in 
Figure 2, shows how slight the relationship between the two test perform- 
ances is at the adult level. 

The correlation between the Stanford Binet and the Porteus Maze Test, 
-{-.59 rfc.io, is about the same or slightly lower than the correlation re- 
ported by other authors for children. Morganthau found a correlation of 

» F. L. Goodenough, Measurement of Intelligence by Drawings, 31. 
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+•54 ±.05 for her no boys and girls; 10 Gaw a correlation with Stanford 
Binet I.Q. of +.52 for 52 boys and one of +. 29 for 48 girls; 11 and Worth- 
ington a correlation of +.75. 12 

The scatter diagram for the 43 adults shows a rather extensive spread 
around the regression line, with some fanning above a score of about 144 
on each test. The cases are too few to make a definite conclusion possible, 
but the tendency is in line with the statements Porteus has made about the 
closer correspondence of Stanford Binet and Porteus scores in the case of 
low-grade than of high-grade subjects. 13 

In view of the fact that some of the tests in this research were more satisfac- 
tory than the Stanford Binet for the adult group from the point of view of their 
acceptability to the subject and their discriminative value, several batteries re- 
quiring about as much time as the Binet were set up and multiple correlations 
calculated. The groups are, of course, too small for the results to be more than 
suggestive. One battery contained three tests which were each satisfactory indi- 
vidually: the Printed Analogies (X,), the Completion Beta (X,), and the Thorn- 
dike Word Knowledge (X 4 ). The findings for the battery were: 

231 = +.860 

X t = 1.32X3 + 7.83X, + .33X4 + 57.14 

°i iu—K '73 

The amount of variation in Stanford Binet score resulting from variation in 
the other three tests was 49 per cent. 

Since a dose relationship with the subject is valuable in many cases, either 
because it affords an opportunity for study of his reactions or because it ensures 
better cooperation, an oral battery was tried. The tests induded were the Stan- 
ford Binet Vocabulary (X s ), the Oral Analogies (X,), and the Oral Absurdi- 
ties (X 4 ). The findings were as follows: 

R« 234 = +.<>14 

X, = r.o8X s + .94X, - 4 * 3.18X4 + 34 

o, j S4 = 12.38 

Sixty-two per cent of the variation in Stanford Binet score resulted from varia- 
tion in the three tests of the battery. 

As a criterion against which to check other tests or batteries of tests, the Stan- 
ford Bind is for from perfect. The difficulties axe particularly serious ia work 

10 Op. at , 38. 

11 F. Caw, A Study of Performance Tettr, Brit. J. Psycho! , Cen. See., 1924-25, *5. 3SJ- 
38S. 

1* M. JL Worthington. A Study of Some Commonly Vied Performance Ttttt, I. Appf. 
Psychol., 1926, 10. 231 

I* S. D. Porteus, Vtfie Tern and Mental Differences. 84. 
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with adults. It is probable that some of the sub-tests show improvement with 
increasing age after maturity while others show little change and still others de- 
cline. If so, the same total score would not have the 'same significance at differ- 
ent ages. This difficulty is, of course, present in any composite score for a battery 
of tests such as those discussed above, and for this reason, whenever the battery 
is not homogeneous, it is undoubtedly better to report the test results separately 
than to employ a composite score. 



Figure 3. Correlation between the Kelley Trabue Completion Beta 
and the Pintner Non-Language Tests: r — +.56; b = +•' °°7 


With results for the three tests reported separately, it is probable that the rst 
battery would make a satisfactory brief examination of intelligence. The Com 
plctton Beta is the most valuable language test of the battery as a whole: it inter- 
ests adult subjects and it has more discriminative power at the upper levels an 
any other. The Thorndike Test of Word Knowledge is discriminative for 
adults from the middle levels of the population and, while it probably me u cs 
a certain number of doubtful items, it is on the whole a good test. In relation o 
these two, both of which demand activities generally considered as excellent 
indices of verbal intelligence, the Printed Analogies is more or less an un not 
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quantity. It probably introduces factors not present in the other test performances, 
but the nature of these factors cannot as yet be defined. 

For old or uncooperative subjects or any others requiring individual work, 
the oral battery would probably be useful. It would undoubtedly be improved 
greatly, however, by an extended Oral Absurdities Tesr. All three tests of this 
battery arc easily administered and well received, and in cases showing resist- 
ance to an intelligence test the battery may be introduced as a language test. 

b. Correlations with the Kelley Trabtte Completion Exercise Beta 

The correlations between the Completion Beta and the language tests, 
including the Printed Analogies, are fairly high. 14 Those with the arith- 
metic tests are also fairly high. The relationships shown by the scatter dia- 
grams are consistent throughout the ranges. All nine of these correlations 
arc reliable. 

The correlation of 4**56 et.io with the Pintner Non-Language Test is 
also reliable. 18 The scatter diagram, which is shown in Figure 3, brings out 
an interesting situation: the relationship is more marked when the Com- 
pletion Beta score is over 6.50; lower Beta scores may be associated with 
high scores on the Non-Language Test. 

In view of the relatively high correlation which both the Completion 
Beta and the Porteus Maze show with the Stanford Binet, and also with the 
Printed Analogies Test, the Beta-Porteus correlation of only 4 *.io — 16 is 
of considerable interest. It is not easily interpreted. The diagram, pre- 
sented in Figure 4, shows a diffuse rectilinear scatter, making the lack of 
relationship between the two variables quite apparent. 


c. Correlations with the Vocabulary Tests 

The first six correlations with each of the vocabulary tests arc reliable 
and fairly high. 16 The correlation of 4 -.S 8 ±.03 between the Stanford Binet 
Vocabulary and the Thorndike Test of Word Knowledge is the second 
highest in the table. 

The correlation between the Stanford Binet Vocabulary and the Printed 

14 The highest of the correlations calculated for the Completion Beta was that with the 
Chapman Readtng-Com prehension Test- 4 - .85 ± 05 for 38 cases 

14 The correlation between the Completion Beta and the Drawing Completion Test of the 
Pintner Non-Language is + 47 — 12 4 ® cases 

19 The highest correlation with the combined time and error scores on the Gray Oral 
Reading Test. Set IV. was that of the Thorndike Word Knowledge. + 69 ±.08 for 44 cases. 
The correlation between the Cray Oral Reading and the Stanford Binet Vocabulary was lower: 
+.57 S:.ic» for 49 cuts 




Figure 4. Correlation between the Kelley Trabuc Com- 
pletion Beta and the Porteus Maze Tests: 
r = +.10; b =: +.004 



Figure 5. Correlation between the Thorndike McCall Reading 
Scale and the Arithmetic Computation Test: 
r = +.66; b = +.43 
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Analogies Test is moderately high, but the diagram shows a wide scatter 
from the regression line. The diagram for the surprisingly low correlation 
of +.31 ±.13 between the Thorndike Word Knowledge and this Analogies 
Test reveals a curious scatter, with a fairly close clustering of the scores 
around the regression line below a score of 50 on the Word Knowledge 
Test, and an increasing fanning of the scores above that (see Figure 6). 

Correlations with the arithmetic tests are lower than with the first six 
language tests of the table, but they are reliable. The diagrams show 
fairly wide general scatters from the regression line, except in the case of 
that for the correlation of the Word Knowledge and Arithmetic Compu- 
tation Tests where the relationship is relatively close at the lower end of the 
scales. 

The diagrams for the remaining correlations with the vocabulary tests 
show no peculiarities. 

d. Correlations with the Thorndike McCall Reading Scale 

All twelve of the correlations calculated between the Thorndike McCall 
Scale and other language tests are reliable, the highest being the correlation 
of -{-.83 ±.04 with the Completion Beta. 17 The scatter diagrams for the 
correlations of -f*.66 with each of the arithmetic tests are similar. That for 
the Arithmetic Computation is reproduced in Figure 5 to show how closely 
most of the cases cluster around the regression line; yet paragraph reading 
and arithmetic computation would hardly be said, on superficial analysis at 
least, to have a great deal in common. 18 

In the case of the other tests correlated with the Thorndike McCall the 
scatters are fairly wide and without pattern. 

e. Correlations with the Gates Oral Spelling and the Stanford 
Achievement Dictation Tests 

The correlation of +.90 ±.03 between the Gates Oral Spelling and the 
Stanford Dictation is the highest inter-test correlation found. In general, 
the scries of correlations with the two tests are similar, but in the case of 
the language tests, with the exception of the analogies, and in the case of 

A similar statement holds Cot the corresponding correlations (or the Chapman Reading* 
Comprehension Test, but the correlations of this senes were in general a lew points higher 
than those with the ThorndAe McCall. 

‘•The correlation between the Thor ad Ac McCall and the Arithmetic Computation Tests 
« not materially changed when age is partial ed out (+ 67 S-oS); it is loner when educa- 
tion Is partialed out (+.<6 2: is). 
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the arithmetic tests, the relationships to the Stanford Dictation are closer 
than those to the Oral Spelling. 

The scatter diagrams show nothing unusual, except in the case of that 
for the correlation between the Gates Spelling and the Pintner Non-Lan- 
guage, where the relationship is closer above a score of 20 on the former. 



Figure 6. Correlation between the Primed Analogies Test and the 
Thorndike Test of Word Knowledge: r = -f-31; b = -f-.ro 


f. Correlations with the Printed Analogies Test 
The highest correlations found for the Printed Analogies are with the 
Stanford Binet Scale and the two arithmetic tests. 19 

There is a large difference between the degree of relationship shown for 
the Printed Analogies and the Stanford Binet Vocabulary and that shown 
for the Printed Analogies and the Thorndike Test of Word Knowledge, 
the former being ax points higher than the latter. The scatter about the re- 

The highest correlations found lor the Ora! Analogic! arc with the Completion Beta, the 
Oral Absurdities, and the Stanford Binet Scale, but on the whole the two analogies tests, 
which correlate to the extent of +.83 i.05, differ comparatively little in their relationships 
to other tests. The pairs of scatter diagrams, as well as the pairs of correlation coefficients, arc 

similar. 






Figure 8. Correlation between the Printed Analogies and the 
Pintner Non-Language Tests: r = +.55; b = +.03 


CORRELATIONAL FINDINGS 


98 

gression lines is wide in both instances, but it is narrower at the lower 
levels in the diagram for the correlation between the Printed Analogies and 
Word Knowledge Tests (Figure 6). 

It is interesting to find that the Printed Analogies Test correlates fairly 
highly with the two arithmetic tests. The scatter diagrams both show closer 



Figure 9. Correlation between the Printed Analogies and 
the Porteus Maze Tests: r = -{—62; b = +.14 


relationship above the means; that for the Printed Analogies and the 
Arithmetic Computation is reproduced in Figure 7. 

The correlation of -K55 at .10 with the Pintner Non-Language Test is 
higher than most of the other correlations with this test. The diagram, pre- 
sented in Figure 8, shows an unusual triangular distribution of scores wi 
the apex in the lower left corner. 

The correlation between the Printed Analogies and the Porteus Maze ts 
one of the most interesting in the table. With the exception of one 
slightly aberrant case, the scores duster fairly closdy around the regres- 
sion line throughout the entire range, and the diagram, which is repro- 
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duced in Figure 9, resembles that for Primed Analogies and the Completion 
Beta. The correlation is higher than those with most of the language tests; 
and this fact together with the high correlations with the arithmetic tests 
and the Pintncr Non-Language certainly suggests the presence of a non- 
verbal factor in this test. 


TABLE XXX 

CORRELATIONS WITH AGE AND INTER-TEST CORRELATIONS OF THE SCORES 
OF SUB-TESTS OF THE PINTNER NON-LANGUAGE (56 CASES) 


DRAWING PICTURE 

HARD COMPLE- REVERSED RECON- 
IMITATION LEARNING TION DRAWING STRUCTION 


OomUtioni with age . 
INTER-TEST CORRELATIONS 

Imitation .... 
Hard Learning . . 

Drawing Completion . 
Reversed Drawing 


— -5a±.io — .461t.11 — .4atr.11 — >5o±.io — .aS±.i3 


-f.67i.07 -t-.69i.07 -f.66i.o8 -f.36i.09 
-f-55* 09 -f.5ai.10 -f.39i.1a 
+.57i.09 -f.60i.09 
-f.47rt.10 


INTER-TIST PARTIAL CORRELATIONS (aCE OUT) 

Imitation .... -f.53i.10 -f.6oiog -f. 5 8io9 -f.5ai.10 

Hard Learning . . -f.54i.09 -f.45i.11 +.38i.ia 

Drawing Completion . -f-53i-io -f.57i.ro 

Reversed Drawing . -f e8i it 


Note; Each correlation is followed by its standard error. 


g. Correlations with the Stanford Achievement Arithmetic Tests 
Inspection of the columns of correlation coefficients for the two arith- 
metic tests in Table XXIX shows that the Arithmetic Computation bears 
about the same degree of relationship to most of the other tests of the 
series, verbal as well as non-verbal, as the Arithmetic Reasoning. The only 
two exceptions arc the correlations with the Stanford Binet and Pintner 
Non-Language Tests. With the Stanford Binet the Arithmetic Reasoning 
gives a higher correlation than the Computation, and with the Pintner Non- 
Language Test the situation is reversed. The scatter diagrams for the corre- 
lations between the two arithmetic tests and the Stanford Binet show simi- 
lar distributions of the scores, but a closer clustering around the regression 
line in the case of the correlation with Arithmetic Reasoning. The scatter 
diagram for the correlation of +-65 ±.08 between the Arithmetic Com- 



IOO 


CORRELATIONAL FINDINGS 


putauon and the Pintner Non-Language Tests shows a wide spread around 
the regression line and it is evident that the relationship is not close. 

h. Correlations with the Pintner Non-Language Test 

In general the correlations with the Pintner Non-Language Test assume 
an intermediate position; the)' are, on one hand, lower than most of the 
correlations with language tests, including the analogies, or with the 
arithmetic tests; and, on the other hand, higher than the correlations with 
the remaining non-verbal tests. All but one of the correlations with the 
Pintner Non-Language Test are significant, the exception being that for 
the Thorndike Word Knowledge. 

The two highest correlations in the group are with the Stanford Binet 
and Arithmetic Computation Tests; the next highest with the Porteus 
Maze,” the Completion Beta, and the Printed Analogies. The remaining 
correlations indicate only slight positive relationships between the vari- 
ables; the scatter diagrams are all more or less diffuse. 

Intcrcorrelations of the Pintner Non-Language sub-tests, with the omis- 
sion of Sub-test «, the Easy Learning, may be read from Tabic XXX. 
They range from -f .69 for Imitation and Drawing Completion to +-39 ** or 
Hard Learning and Picture Reconstruction. These correlations are much 
higher than those reported by Morganthau on children of ten to sixteen 
years. 22 When those for Sub-test 2 are omitted, the correlations obtained 
by this author range from for Reversed Drawing and Drawing Com- 
pletion to -}-^6 for Reversed Drawing and Picture Reconstruction. « 
seemed possible that these correlations resulted in part from the larger age 
range among the adults. With age partialed out, howeser, the second order 
correlations, which are shown in the final section of Table XXX, are still 
higher than those reported by Morganthau. 

i. Correlations with the Goodcnottgh Drawing Test 

The correlations with the Goodenough Drawing Test are almost all low. 
Only three were of a degree higher than might hasr beat obtained by 

*° Morgantiua found a correlation of +.36 between the Pintner Non-Language and the 
Porteus Marc. (Op. at., ^5.) _ _ ___ 

** The total score on the Pintner Non-Language Test, whenever given, includes the 
on all sub-tests. Sub-test 2 was not studied in relation to the other tests, however, because 
gave a skewed distribution which showed that it was inadequate at the adult level. 

«Op nr, 33. 
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chance; those with the Pintner Non-L3nguage, the Stanford Binet, and 
the Thorndike McCall Reading Scale. 2 * The diagrams are all similar, with 
a diffuse scatter around the regression line throughout the entire range. 

j. Correlations with the Portent Maze Test 
From the point of view of the correlational analysis, the Porteus Maze 
Test was one of the most interesting. The relationships indicated by the 
coefficients of correlation are puzzling. Four of these coefficients are re- 
liable: the highest is +.62 rfc.io with the Printed Analogies, and the two 
closely following are 4**59 with the Stanford Binet and +.58 de.ir 
with the Pintner Non-Language Test; the fourth reliable correlation is that 
of -41 ±.14 with the Gates Oral Spelling Test. The very low correlation 
with the Completion Beta has already been noted (see Figure 4) . There is a 
slight but unreliable negative correlation with the Thorndike Test of Word 
Knowledge. 

**The correlation of -f.47 ± i» between the GooJenough Drawing and the Drawing 
Completion T«t of the Pintner Non-Language i* al» reliable. 



IX 

SURVEY OF THE FINDINGS AND THEIR SIGNIFICANCE 


1. Adult Performance Levels 

2. Comparative Findings for Men and Women 

3. Age and Test Performance 

4. Relationships between Various Performance Levels in the Individual 

Case 

5. Education and Test Performance 

6. Occupation and Test Performance 

7. Relationships between Various Test Performances 

I. ADULT PERFORMANCE LEVELS 

T HE scores reported in Table XI are too few in number to establish 
standards for the various test performances, but they are probably 
good indications of the standards for the middle levels of the adult urban 
population. The facts which justify this statement arc the characteristics of 
the sample in relation to data from the 1930 census report and from the re- 
sults of the Army testing, and also the consistency shown in the findings 
themselves. As reported in Chapter II, there is a close correspondence be- 
tween the Philadelphia population and the test group with regard to the 
proportions in different types of occupation. On the question of the educa- 
tional and occupational levels of the test group, and indirectly of its intel- 
lectual status, the Army findings offer more direct evidence. The mean 
school grade reached by the males of the test group is 8.0 and that reached 
by the white draft in the Army 6.9. It has been noted that this difference 
probably reflects changing conditions in education rather than a definite 
superiority in the test group. In terms of occupational rating on the Barr 
Scale there is a very close correspondence: the mean for the males of the 
test group is 8.0 and that for the group of occupations for which the median 
Army Alpha scores was nearest the mean score for the total draft is 8.1. 

Interesting evidence of the consistency of the results which has not yet 
been presented is to be found in comparisons of the mean scores for the 
three separate hospital groups. It is known that these differ slightly in the 
educational and occupational status of their ward patients and that a rank- 
ing on either of these criteria would place the Orthopedic Hospital first, 
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the Graduate second, and the Philadelphia General third. Consequently, it 
would be expected that, if the samples at the separate hospitals were ade- 
quate to give trustworthy results, the mean test scores would indicate the 
same ranking. Analysis shows that they do; mean scores for all but three 


TABLE XXXI 

MEAN SCORES ON THE MOST IMPORTANT TESTS TOR PATIENTS 
DISTRIBUTED BY HOSPITAL SOURCE 



PHILADELPHIA 

CZNIML 

HOSPITAL 

N Score 

GRADUATE 

HOSPITAL 

N Score 

ORTHOPEDIC 

HOSPITAL 

N Score 

Stanford Binet Scale - 

20 

> 57-8 

l 3 

154-5 

32 

171.0 

Completion Beta ... 

l6 

7-2 

*3 

7-5 

23 

8.3 

Stanford Binet Vocabulary . . 

22 

52.2 

15 

5>-5 

32 

57-1 

Thorndike Word Knowledge . 

*4 

56-4 

12 

57-4 

23 

66.3 

Thorndike McCall Reading . . 

*7 

23.8 

12 

248 

30 

23.1 

Chapman Reading-Comprehension 

>5 

153 

11 

15.6 

:6 

18.0 

Gray Oral Reading 

12 

S0.3 

n 

78.1 

26 

68.9 

Gates Oral Spelling . . 

*5 

22.1 

13 

25.1 

25 

29.0 

Stanford Dictation .... 

18 

82-} 

M 

83.3 

26 

99.6 

Oral Analogies . . ... 

18 

18.7 

13 

23.0 

to 

22.0 

Printed Analogies ... 

30 

19-4 

*3 

20.1 

27 

2S 7 

Arithmetic Computation . 

16 

23.6 

n 

= 6-4 

23 

=;-5 

Arithmetic Reasoning .... 

>5 

215 

ri 

= 2-4 

28 

237 

Pintnci Non-Language 

18 

327.I 

11 

3684 

27 

371.1 

Kuhlmann Immediate Recall . 

8 

201.9 

11 

172.9 

25 

163.8 

Goodenough Drawing 

16 

25.I 

1- 

25.2 

25 


Porteus Maze 

iS 

1623 

12 

1730 

>5 

* 7 - 4-4 


tests are highest at the Orthopedic, second highest at the Graduate, and 
lowest at the Philadelphia General. As shown in Table XXXI, two of these 
three exceptions are for the Stanford Binet Scale and the Stanford Binct 
Vocabulary, on each of which the Philadelphia General group slightly ex- 
cels that at the Graduate, and the third is for the Oral Analogies, on which 
the groups at the Graduate and Orthopedic make the same mean scores. 
The remarkable consistency which this analysis res cals would not be sur- 
prising if the three groups were (ar apart, but they are not; the mean 



ADULT PERFORMANCE LEVELS 


105 

ten years. The Completion Beta is the only test showing an average score 
much above fourteen, but there are a number showing average scores which 
fall below thirteen, notably the Pintner Non-Language (12-6), the Kuhl- 
mann Immediate Recall (between 12 and 13), the Substitution (12-0), the 
Arithmetic Computation (ii-ii), and the Seguin (n-o). Incidentally it is 
interesting to note that one of the tests in this low- mental-age group is the 
arithmetic computation, and the other tests all non-verbal. In short, there is 
definite evidence that scores for adults on a test which includes different 
types of performance or composite scores on a variety of tests can never be 
justly expressed In terms of mental or educational ages for children; fur- 
thermore, even when the comparison is limited to one particular perform- 
ance for which standards have been determined on a large and unselected 
group of children, scores for the adult in terms of mental ages are still mis- 
leading because they imply a qualitative similarity in performances which 
are simply rated as equivalent numerically. 

Another confusion which the mental-age method of expressing test re- 
sults is likely to introduce is that between the average level for a group of 
adults of a wide age range and the age at which the highest average score is 
reached. Because the average scores for the adults of this group with ages 
up to sixty were equivalent to fifteen years and four months on the Com- 
pletion Beta and twelve years and six months on the Pintner Non-Language 
Test, for example, it is not to be supposed that the peak of development is 
reached a little after fifteen on the former and between twelve and thirteen 
on the latter. The Completion Beta showed a correlation of only —.19 ±.13 
with age, the Pintner Non-Language a correlation of —.53 ±.10. In gen- 
eral, the larger the number of older subjects in any group examined, the 
lower the average score and the mental-age equivalent on either test. The 
larger proportion of older subjects, however, would bring the greater re- 
duction in the case of the Pintner Non-Language, though the peak of de- 
velopment on this test might come at fifteen, twenty, or even later. 

A further point which must be noted is that the tests differ greatly in dis- 
criminative power for adults. In reporting the findings it was stated that 
certain of the tests were decidedly inadequate in this regard, notably the 
Morrison McCall Spelling Scale and a number of the non-language tests— 
the Easy Learning, the Two-Figure and the Casuist Formboards, the Ship 

other tests were given had too great difficulty in reading to attempt the Completion Beta} but 
if these cases were added at the Jotvet end of the distribution for the Completion Beta and a 
median calculated, the median would soil &1! at a mental age above 15. In short, the higher 
level on the Completion Beta is not to be explained on the basis of a selected group. 
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Test, and the Healy Picture Completion II. Even among the other tests of 
the battery there are differences in discriminative value. In many cases, of 
course, a piling up of scores at the upper end of the scale means simply that 
the test is too easy for adults. This situation is particularly obvious in the 
ease of the performance tests. The problems they present are so easy that 
the scores for many adults must come close to the physiological speed limit 
for the activities imolvcd. For other types of test different ■explanations for 
the negathely shewed distributions may well be necessary. It has been sug- 
gested that spelling is an activity which the majority of adults, including 
those of only moderate education and ability, have fairly well in command 
and that no process of increasing the difficulty of a test, short of introduc- 
ing catch words, would yield anything but negatively skewed distributions. 
Spelling or any other activity which shows negatively skewed distributions 
may be studied at adult levels for a knowledge of the particular perform- 
ance, but for a study of mental growth or individual differences between 
adults the mere satisfactory indices are, of course, the tests with the higher 
discriminative value. The choice among these tests is a further problem, the 
solution of which will depend on the specific purposes of the study; in some 
cases it may be desirable to test performances like those tested by the Printed 
Analogies which show a definite decline in average scores after early matur- 
ity; in other cases it may be better to test performances such as vocabulary 
which are relatively well maintained through the adult years, and inciden- 
tally it may be remarked that there arc probably many other performances 
of this type for which satisfactory tests have yet to be devised. 

2. COMPARATIVE FINDINGS FOX MEN* AND WOMEN 

In studying sex differences the small size of the group was a disadvan- 
tage, for comparisons had to be made between all males and all females 
rather than between the groups at short age periods. The diversity of the 
tests on which data had been obtained was valuable, however, since it per- 
mitted comparisons on many performances for which sex differences are in 
dispute. 

Analysis of the material not only fails to indicate male superiority on nu- 
merical or spatial tests and female superiority on verbal or memory tests, 
but shows no significant sex differences for any of the tests of the battery. 

In most cases mean scores for the two sexes, are very dose, but it is note- 
worthy that the men are slightly superior to the women on all but three o 
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the important tests. Comparisons of sigmas reveal no general differences in 
variability in the two groups. 

In the relatively small size of the differences the results are on the whole 
in accord with recent work, which has indicated that sc\ differences are less 
marked at adult than at childhood and adolescent levels. Most of the 
studies of adults, however, have shown greater differences between men 
and women than were evident in this research. 

3. ACE AND TEST PERFORMANCE 

Two approaches to the important problem of age and test performance 
have been made: the comparison of mean test scores for different age groups 
and the correlations of the various tests with age. The findings indicate that 
differences in age from twenty to sixty have comparatively little effect on 
most test scores. For the small decade groups there arc no significant dif- 
ferences between mean scores at any one-decade interval or at any two-dec- 
ade interval, and the majority of the tests show only slight and insignificant 
correlations, usually negative, with age. It would not be adequate to con- 
clude, however, that the adult at any age from twenty to sixty stood an 
equally good chance of making the same score on any of the tests of the 
battery. Certain tests show significant differences in mean score between the 
twenties and the fifties, and others show significant differences between 
the larger age groups twenty through thirty-four and forty-five through 
fifty-nine. In general, the evidences of an increase in score with age appear 
only for the vocabulary tests, notably the Thorndike Word Knowledge, 
while the evidences of a decrease with age are most marked in the case of 
the analogies and the non-language tests. The Thorndike Word Knowl- 
edge is the only test showing a reliable positive correlation with age; both 
analogies tests, the Mare and Foal, the Seguin, the Substitution, the Pintner 
Non-Language, and the Porteus Maze show reliable negative correlations 
with age. 

That high points in development apparently occur at different ages and 
that the course with age varies in different test performances are conditions 
of great importance. The fact that scores on most of the language tests, 
and also on the arithmetic tests, are well maintained in the thirties suggests 
that development in these fields continues not only beyond fourteen, six- 
teen, or twenty, but that it may still be evident up to thirty or beyond. It is 
to be noted, however, that while the means for the small decade groups were 
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higher in the thirties than in the twenties in the case of nine of the eleven 
language or arithmetic tests, the peaks of development for the much larger 
groups studied by Willoughby and by Jones and Conrad almost all fell in 
the twenties or before. 

The vocabulary, reading, spelling, and arithmetic tests, on which the 
scores are not only relatively high in the thirties but remain relatively high 
at later ages, may be said to involve activities fairly well within the experi- 
ence of the older as well as the younger adult. It seems natural that vocabu- 
lary should be well maintained and should even improve with age. Inter- 
estingly enough, arithmetic computation, which apparently depends to a 
considerable degree on school training, also re m a in s on a rather even level 
between the twenties and the fifties. There is obvious truth in the sugges- 
tion that these performances are all well maintained in adult years because 
they are within the natural realm of experience for the older person, but the 
possibility that they involve abilities which mature late or decline slowly 
must not be overlooked. 

The early peaks followed by steady and rather rapid decline which appear 
for the analogies and non-language tests, must also depend on several fac- 
tors. It has long been believed that motor skills requiring rapid movements 
were best accomplished by the younger person, and recent investigators 
have contributed more precise information on this point. Miles has shown 
that motor and perceptual abilities, especially those which depend on speed 
rather than diligence, manifest a steady decline after earl) maturity, 
though the declines are not so sharp as those for certain sensory capacities 
and memory and learning abilities. 4 The findings for the patients m is 
research on the Seguin, the Substitution, the Hard Learning, an e re 
and Foal may be considered as supporting the evidence o a stca ) °P*° 

relatively simple activities where perceptual and motor abilities and speed 
arc at a premium. , _ , 

The findings, however, lie those of Willoughby end Jones end Conred, 
elso show steady end rather marked declines in complex non-language tests, 
in svhich speed and skUl in certain relatively simple motor and perceptual 
abilities are not so important, and also in the analogies tests. e op on 


• W. R. MDc. Age eel »~-****g± ^ rf £££*.^1. qaerioarf _ 
8 Th; interpretation of the declines with on wme o . .wjiries. 

■hr smuad list 4c M„ raljcco e„ X^faa 4 siniocu. 

correlation exists between age and score on an oral directions tes ( f 
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certain of the sub tests of the Pintner Non-Language probably could not be 
explained on the basis of decreasing speed or facility in simple motor or 
perceptual operations. The decline on the analogies tests could not be attrib- 
uted to any secondary factor, such as decreasing speed or visual efficiency or 
failure to adapt to a difficult situation. For none of these tests have the per- 
formances been sufficiently well analyzed to provide an explanation, al- 
though the possibility of a spatial factor immediately comes to mind. In 
this research the Printed Analogies repeatedly showed characteristics of 
some of the non language tests, or close relationship to them, but it was not 
possible to determine whether these could well be accounted for by the as- 
sumption of 3 spatial factor. The problem is of considerable importance 
and one which requires further work on the characteristics of normal men- 
tal functioning before the many recent studies of pathological disturbances 
in “spatial thinking” can be rightly interpreted. 

In a cross-section study of this sort it is, of course, impossible to deter- 
mine whether the more intelligent man continues to develop longer than 
the less intelligent; the only statements which can be made are for the gen- 
era! tendencies of average or near-average individuals at different ages. On 
the whole, it is clear that the greatest development in “test intelligence” oc- 
curs before the age of twenty, and that the gains or losses which appear 
from that age to sixty are comparatively very slight. They do occur, how- 
ever, and the losses in some activities, notably in the non-language field, are 
of sufficient magnitude to require the establishment of "normal” levels for 
different age groups in the adult years. 


4. RELATIONSHIPS BETWEEN VARIOUS PERFORMANCE LEVELS IN THE 
INDIVIDUAL CASE 

The extent of the test battery made it possible to study the relationships 
between a large number of performance levels in each individual case— a 

tween age and score on the Ons Test, but that adaptability to new situations (Marsh, op. at, 
detetctucuul by score on a number of performance tests, shows only a slight tendency 
to decrease with age. Apparently, then, k ■* safe to assume that declines on non language 
tests, such as were found tn this research, depended on increasing age rather than on a pro- 
grcssisely poorer adaptation among the older subjects Certainly observations of the patients' 
reactions would support such a conclusion In view of the considerable drop with age in the 
understanding of oral directions, however, it is apparent that age curves for tests requiring 
complex oral directions must be interpreted with caution, and particularly in the case of 
grade-school subjects, who showed a more marked decline than college subjects in the Stan- 
ford Study. 
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been regarded as one of the best indices of intelligence. Nevertheless, the 
extent to which success on this vocabulary test apparently reflects a man’s 
intelligence rather than his education is surprising; only .8 per cent of the 
variation in scores results from the variadon of education while 12.4 per 
cent results from the variadon of test intelligence. 

Among tests of specific school subjects, the reading comprehension and 
the aiithmedc reasoning tests might also be expected to show a closer rela- 



Figure 10. Relations between educational and various test levels. 
One case at third grade level and two at fifth 


don to intelligence than to educadon, for while they demand particular 
skills learned in school, they also demand complex mental functioning 
which characterizes the intelligent person and cannot be instilled into the 
dull. The results confirm this supposition, and show that the performances 
required by the reading comprehension and the arithmetic reasoning tests 
are indeed more closely related to test intelligence than to educadon. 

The activities involved in the other educational achievements analyzed — 
oral reading, spelling, and arithmetic computation— would generally be 
considered more closely dependent on school training. The most striking 
fact of the analysis is that the spelling tests, the Gates Oral Spelling and the 
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Stanford Dictation, arc more closely related to test intelligence than to 
education, 8 while the Gray Oral Reading shows little difference one way 
or the other, and only the Arithmetic Computation reveals a slightly closer 
relation to education than to intelligence. The evidence that the school 
grade reached is a comparatively unimportant factor for success in spelling 



Figure ir. Relations between educational and various test levels. 
Four cases at sixth grade level and one at seventh 


is clear: with Stanford Binet scores partialed out, correlations for the two 
spelling tests with education are reduced from +.59 and +.60 to +.20 
and +.23; with education partialed out, correlations with Stanford Binet 

are reduced only from +.68 and +.74 to and +.57. 

The correlational findings were extremely interesting, ut e groups °_ n 
which the partial correlations had been calculated were so small that It 


• It is worth nodog in this connection tb 
component parts of the Stanford Binet Scale 


tt neither spelling nor writing to dictation a 
,s vocabulary and reading comprehension are. 
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seemed worth while to make an analysis of the relation between educa- 
tion and test performance in the individual cases. Seven tests were selected 
as a basis for the study: the Pintner Non-Language, the Stanford Binet, 
the Thorndike Word Knowledge, the Stanford Dictation, the Thorndike 
McCall Reading, and the Stanford Arithmetic Computation and Reason- 



Figure 12. Relations between educational and various test levels. 
Five cases at eighth grade level 


ing Tests. For each of the 35 subjects who had taken all of these, the sigma 
deviation of each score from the mean for the total group was determined 
and the individual case records were plotted on graphs such as those shown 
in Figures 10 to 14. Sixteen cases showed a difference of less than .3 sigma* 
between the deviation from the mean in educational level and in Stanford 
Binet score, and for simplicity in presentation these have been omitted 
from the final graphs. The remaining 19 subjects, with differences of .3 

_ T Tbis point was selected arbitrarily as the one which gasc the most sanslactorjr distribu- 
tion of cases for clarity in the graphs. 
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sigma or more between the levels in education and in Stanford Binct, are 
the interesting cases from the point of view of the relationship between 
education and test performance. 8 Nine of them made Stanford Binet scores 
.3 sigma or more above their respective educational levels, while the other 
10 made scores .3 sigma or more below. 



Figure 13. Relations between educational and various test levels. 
One case at ninth grade lcsel and two at twelfth 


A study of the graphs will show that the individual case hues follow he 
level set by the Stanford Binet score fir more often than the level set by th 
school grade completed. Among the 9 cases with Stanford s< f r “ 

which are high in relation .0 the educational levels, 5 or pssubly 6 show 

test performances on the Thorndike Word Knowle ge, e tin or 11 

tion, the Arithmetic Reasoning, the Thorndike McCall Reading, and the 


• Tie eduarioml lew! U show. .. *e «- W N ““ ^ 

more than one school grade appear in Figure* r®» **» *3» M **' 
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Arithmetic Computation which are closer to the Stanford Binet than to 
the educational level. 9 These are cases a in Figure 10, a, b, and c in Figure 
11, and a and possibly b in Figure 12. These subjects were apparently able 
to compensate for a relatively poorer education than their intelligence 
probably warranted. Cases b in Figure io, d in Figure 11, and a in Figure 



Figure 14. Relations between educational and various test levels. 
Two cases with equivalent of fourteen years’ schooling 
and one with equivalent of sixteen 


13, on the contrary, each with the Stanford Binet slightly higher than the 
educational level, not only failed to bring up other test scores to match the 

» The Pintner Non-Language Test «as included in the analysis to give in additional index 
of intelligence. Its correlation with education, +.56 ;t.io, is almost as high as that of the 
Stanford Binet. +.6* ± 08. In 7 or the 19 cases the levels on the Bwct and the Pintter are 
less than .5 sigma apart. Among the remaining eases there is one where the various tests fall 
nearer the Pintter than the Binet and 7 others where some of the tests fall nearer the Plotter 
and some nearer the Binet. To a certain degree, therefore, the level on the Pintter may pos- 
sibly indicate abilities which account for the irregularity in eases where the various perform- 
ances do not approximate the level of the Stanford Binet; but so many different factors are, of 
course, involved that it is difficult to estimate the effect of one or two in isolation. 
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Stanford Binet, but for the most part obtained scores relatively inferior to 
the educational level in terms of sigma deviation. 

All 10 subjects in the group having a Stanford Binet score lower than 
the educational level showed performance levels on the vocabulary and 
educational achievement tests which were also as a general rule inferior to 
the educational level. These cases must, however, be considered in relation 
to the part of the scale of ability in which they fall. Cases c in Figure JO, 
e in Figure 1 1, c, d, and c in Figure 12, b and c in Figure 13, and c in Figure 
14 are apparently subjects who were either unable to profit by the extent of 
the educational training they experienced or deteriorated after leaving 
school. These subjects stand in direct opposition to those of the first group 
who seem to have been able to compensate for a relatively poor education. 
Cases a and b in Figure 14, while they appear from the graphs to be subjects 
whose test performances fall considerably short of their educational levels, 
are not to be classed among those who failed to profit by school training or 
deteriorated after school. The situation in these two cases is more correctly 
understood as the result of inadequacies in the tests employed; both sub- 
jects came close to the upper limits on most of the tests and would un- 
doubtedly have shown less discrepancy between sigma deviation in test 
performance and sigma deviation in education if the tests had been more 
difficult. 10 

It goes without saying that the many factors involved make the signifi- 
cance of the findings somewhat problematical. For a decisive conclusion 
the analyses would, of course, have to be carried out on larger groups in 
which the factors of age, type of occupation, general environment, and 
even marked differences in personality were controlled as far as possible. 


10 The scores for these two subjects, one with fourteen and 
ing, were as follows: 

Pintncr Non-Language ... 

Stanford Binet .... 

Thorndike Word Knowledge 

Stanford Dictation ....... • 

Arithmetic Reasoning .... • • 

Thorndike McCall Reading 

Arithmetic Computation 


with sixteen je 

. . . 500 

. 1 96 

■ . 73 

. . ri5 

■ • 34 

. . 3<J 

. - 37 


s' school- 

Caseb 

4S0 

9« 

i*4 

35 

3» 

45 


It must be noted that the scores of subjects like these, for whom the tests tvere not adequate, 
would decrease the correlations between education and test performance. There were ew sue 
subjects, however, and for the group as a whole the tests were frilly satisfactory from the 
point of view of discriminative power, so that the correlations are probably little affected. 

11 There is little difference in type or level of occupation between the intelligent subjects 
with poor education and relatively high test performances and the duller subjects with good 
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Nevertheless the findings strongly suggest that those adults who had pro- 
gressed further in school than their intelligence probably warranted had 
never acquired or had forgotten much of what they were taught in funda- 
mental school subjects, while those who had not continued in school so 
far as their intelligence warranted yet had by some means acquired an 
amount of formal knowledge commensurate with their respective intelli- 
gence levels. 

The tests used in this research are to a certain extent unsatisfactory for 
a complete study of the problem, not only because of their somewhat lim- 
ited discriminative power but also because they do not cover a sufficiently 
wide range of performances. It may be said that the individuals who re- 
main in school longer, regardless of intellectual level, acquire more exten- 
sive knowledge than those who leave early, for example, in the fields of 
history, science, or literature. It would be surprising, however, if those 
who remained in school and profited by these studies did not continue to 
improve in subjects tested in this research, notably in vocabulary and the 
understanding of reading. In other words, tests of these subjects are at least 
partial indices of educational development beyond the stage of the three 
R’s, and the results indicate clearly that success on these tests has a much 
closer relation to intelligence than to education. There is, of course, still 
the possibility that those who remain in school longer, the dull as well as 
the bright, gain in subtler values which are not demonstrable by tests, but 
impressions gained from close work with the subjects did not relate such 
values to length of school training. 

The trend of the results gives one pause, particularly in view of the sums 
spent to provide eight or more years of formal school training for each 
child. It seems clear, as educators and psychologists have long contended, 
that the adequate determination of the child’s abilities is an essential part 
of any educational program, and that the nature and extent of the training 
should be adapted to these abilities. In this research there were clear ex- 
amples of persons who were intellectually unable to profit by more than 
a certain minimum of the ordinary school curriculum, and yet had been 

education but relatively poor test performances. The subjects of the first class included a 
weaver, a rigger, a huckster, a receiving clerk, and a clerical worker; those of the second class 
a bartender, a hat blocker, two clerks, a sewing-machine operator, an assistant forewoman, 
and an automobile mechanic. In other words, within the limits of the group analyzed here, 
there is no indication that differences in test performances m adult years depend on the char- 
acter of the daily occupation. Occupation, like education, is undoubtedly a (actor and there 
are, of course, other factors as w ell, but in all probability none of them is so important as 
intelligence. 
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exposed to this sort of training for as muds as eight years. They had not 
yet enjoyed and probably never would enjoy any but the simplest skills in 
reading, writing, and arithmetic, and there is little doubt that a good part 
of their school years should have been spent on other types of training, 
vocational or nvoeatinnal. The individuals in the other class, who had re- 
ceived less formal schooling than they could have absorbed, may leave one 
content with a laiites ftire doctrine, but it must be noted that these adults 
wiio did well in spite of a short school training would probably have done 
still better if they had been carried through an educational experience more 
nearly in accord with their intellectual ability. 

6. OCCUPATION AND TEST PERFORMANCE 

When the groups of the Taussig Scale are combined to form three— the 
first, professional, semi-professional, and business; the second, skilled labor; 
and the third, semi-skilled and unskilled labor — Chi-Square determinations 
show a definite association between success on three of the tests and the 
professional and business group. These three tests are all verbal: the Com- 
pletion Beta, the Stanford Binet Vocabulary-, and the Thorndike McCall 
Reading Scale. The contingency table for the Completion Beta is particu- 
larly interesting: only one case from the professional or business group 
made a score below the mean while 1 0 of the 20 cases of semi-skilfed and 
unskilled labor made scores below the mean. None of the non-language 
tests was found to have any definite association with any of the Taussig 
groups. On the basis of the Chi-Square determinations, then, it is possible 
to say that members of the professional and business group arc definitely 
superior to the other two groups in the abilities involved in the Completion 
Beta, Stanford Binet Vocabulary, and Thorndike McCall Reading Scale. 
They are not superior in the abiliucs sampled by the non-language tests, 
and at the same time it is noteworthy that the skilled laborers do not excel 
the other groups here. 

The correlations between test scores and Barr Scale ratings arc rather 
low in view of the fact that die ratings were assigned on the basis of the 
intelligence supposedly necessary to carry on the given occupation. The two 
highest of those calculated — the correlations with the Arithmetic Reasoning 
and the Thorndike Word Knowledge— were about +£0, while that for 
the Stanford Binet was only +-44, and those for the Pintner Non-Language 
and Porteus Maze were lower and unreliable. 
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7. RELATIONSHIPS BETWEEN VARIOUS TEST PERFORMANCES 


Statements as to the relationship between various test performances must 
necessarily be based on the intercorrelations, for the comparatively small 
number of subjects taking all the more important tests precluded any fur- 
ther statistical analysis. It is impossible therefore to make any definite state- 
ments as to the mental functions involved or their relative prominence. The 
days have long gone by when each investigator could suit his own fancy 
in stating what his test tested, but it is to be noted that even now his sta- 
tistical analysis of mental functions leaves him considerable latitude in 
naming them, atld that there is still room for speculations on the basis of 
such material as is available in this study. 

All but one of the test performances for which calculations were made 
were found to be positively correlated, the exception being the slight nega- 
tive correlation between the Thorndike Test of Word Knowledge and the 
Porteus Maze. While some of the correlations with the more specific non- 
verbal tests were low and unreliable, the majority of the correlations were 
moderately high. The fairly close relationships thus indicated apparently 
depended rather on the functions involved than on the wide age range of 
the group, for when age was partialed out in a number of crucial cases the 
correlations remained about the same. When educational level was par- 
tialed out, however, some of them dropped about ten to twenty points. 

The intercorrelations of the tests of the language group discussed in 
Chapter VIII are in general the highest of the lot, and arc of sufficient 
magnitude, even if educational level is partialed out, to show clearly that 
there is a fairly close relationship between the functions tested by the Stan- 
ford Binet, the Completion Beta, the Stanford Binet Vocabulary, the 
Thorndike Word Knowledge, the Thorndike McCall Reading, the Gates 
Oral Spelling, and the Stanford Dictation Tests . 13 Among the intercorrela- 
tions in this group the highest appear between tests supposedly sampling 
the same type of activity, that is, between the two spelling tests and the 
two vocabulary tests . 13 On this point, at least, the correlations support the 


la For simplicity in presentation the statements made in dm and the following paragraphs 
are based on the most important tests of the batter)- In general, however, they apply to all of 
the tests reported. For example, the Oral Absurdities shows the characteristics of the other 
language tests reported here from the point of view of the correlational findings. Similarly, 
the Seguln and the test of Drawing a Chair show characteristic like those of the other 
nther specific non-verbal tests, for example, the Goodenmigh Drawing. 

11 High correlations also appear ****'«' ** Chl ?‘ 

man Reading-Comprehension, and between the Oral and Printed Analogies Tests. 
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logical deductions which one would make as to the nature of the abilities 
involved. In order of degree the next correlations are those between the 
reading comprehension tests and certain of the so-called language intelli- 
gence tests. These arc of sufTicient magnitude to indicate that the functions 
involved are definitely related. It seems clear that for the adults of this 
group there is little distinction between the language intelligence tests and 
the reading comprehension tests, cither in their intcrcorrclations or in their 
correlations with other tests. While there is evidence of similarity in the 
function or functions involved in all these tests, the definition of these 
functions is, of course, another matter. The most obvious guess is that all 
these performances involve verbal ability and that the major part of the 
relationship is probably to be accounted for on this basis. 

When the tests of the battcry'were first grouped the analogies were pur 
with the “language intelligence'* tests, for they obviously involved word 
knowledge and facility in the production of linguistic symbols under cer- 
tain controlling conditions. All the evidence showed, however, that these 
tests stood between the verbal and the non-verbal. They bore moderately 
close relationships to the tests of the language group, although not so close 
as the interrelationships among these tests, and at the same time they were 
fairly closely related to the arithmetic and the non-verbal tests. They showed 
particularly low correlations with the Thorndike Word Knowledge, a fact 
which seems to indicate that the analogies test is not so much a test of 
vocabulary as has sometimes been supposed. What the functions arc which 
set the analogies a little apart from the other language tests is, of course, not 
known; but it may possibly be, as already suggested, that they involve a 
spatial factor and that this is the explanation for their relationship to some 
of the non-verbal tests, notably the Porteus Maze. 

A further consideration of the correlations between the tests in the lan- 
guage group and those outside it suggests that there are definite relation- 
ships between the functions tested by the language and. by the arithmetic 
tests, and that these arc closer in the case of the Arithmetic Reasoning Test. 

As to the Pintner Non-Language and the Porteus Maze, there are still 
definite relationships with the Stanford Binet bur, in general, low and often 
insignificant relationships with the other tests of the language group. It is 
evident that these two tests involve functions considerably different from 
those tested by the language tests, and also by the arithmetic tests, and the 
irregularity in the series of correlations with the Pintner Non-LaQguage 
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and with the Porteus Maze would suggest the presence of several different 
factors. 

The Goodenough Drawing, as well as other non-verbal tests of the bat- 
tery, apparently involves different and probably fairly specific factors, bear- 
ing little relationship to the language or the arithmetic tests. These non- 
verbal tests show certain relationships to the Pintner Non-Language and 
some of its sub-tests, and certain interrelationships among themselves, par- 
ticularly in the case of tests of apparently similar processes such as the Hard 
Learning and the Substitution. On the whole, however, these tests clearly 
involve more specific functions than do the verbal, and this is probably the 
most that can be said about them at present, for it is impossible to make 
any immediate determinations of the functions actually involved. 



X 

CONCLUSIONS 


T HIS research has tapped a new and valuable source for die study of 
adult intelligence. Too little advantage has jet been taken of the 
opportunity the hospital offers for detailed psychological studies of adults 
from the middle levels of the population. Within the hospital group there 
arc undoubtedly many types and degrees of abnormality in mental func- 
tioning, but there are also patients who may safely be said to be "normal’* 
mentally. Among them there are cases requiring hospitalization for ttvo 
weeks or longer, and such patients are practically unique among adult sub- 
jects in that their time is comparatively unlimited and most of them are 
willing to spend a large amount of it on the examinations. They may be 
given not one but many lesti, with the result that a much more complete 
knowledge of the individual's mental functioning can be obtained than is 
usually the case. Furthermore, they may be given repeated examinations 
without fear that their interest will /lag; svhen the studies are properly con- 
ducted, interest in the examinations as well as friendliness with the exam- 
iner usually increases steadily throughout the work. It is true that indi- 
vidual studies are costly, but they are rewarded by the certainty that the 
tests have served their purpose fully and that they have yielded not only a 
score value for the particular performance, but an opportunity for observing 
how that performance was carried through and what difficulties stood in 
its way. 

The hospital group as a whole is the only single group which comprises 
all social and economic strata. The ward patients, who formed the subjects 
in this research, constitute a more limited sample, but a sample which has 
every indication of being truly representative of the middle levels of the 
population. It has been shown to have approximately the same occupational 
distribution as the population of Philadelphia ; it seems to be close to the 
average of the Army population with regard to the grade of intelligence 
required by the occupations represented; and it shows a distribution for 
education similar to the Army distribution. 

The correlation between intelligence and the predisposition to disease or 
environmental conditions favorable for disease is still an unknown quantity. 

If it is negative the consequence would naturally be that hospital patients, 
because of the disease conditions, tended toward inferiority in intelligence. 
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z. Vocabulary tests arc well suited for work with adult subjects and 
readily accepted by them. In use with the normal subjects in this research 
there was little to choose between the two vocabulary tests from die point 
of view of their discrimination within the group or their relation to other 
tests. The Stanford Binet Vocabulary, however, naturally had the advan- 
tage over the Thorndike Test of Word Knowledge in providing an oppor- 
tunity for study of the patient's reactions and attitudes. 

3. The mixed analogies test in the forms arranged in this research differ- 
entiates fairly well among “average" adults, is readily accepted by them, and 
because of its apparent relationships with the non-verbal tests and its 
marked decline with age is one of the most interesting theoretically. 

4. The absurdities test is also well adapted for use with adults, but is not 
satisfactory in the forms employed with these subjects and needs further 
development. 

5. Many of the educational achievement tests employed were satisfactory 
in their existing forms. Both tests of reading comprehension, the Thorn- 
dike McCall and the Chapman, were adequate from the point of view of 
their discriminative power, but the former suffered a little in use with 
adults because of too childish content. Both the arithmetic tests, the Stan- 
ford Achievement Computation and the Reasoning, were found to be very 
satisfactory in discriminating among these adults. The Gray Oral Reading 
Test, the Gates Oral Spelling, and the Stanford Dictation each showed a 
piling up of scores at the upper end of the scale, probably not so much 
because the tests were too easy as because the activities involved are fairly 
well acquired by adult subjects, even by those of only moderate education 
and ability. The Morrison McCall Spelling Scale was obviously too simple 
for adults'from the middle levels of the population, but it is well adapted 
for work with low-grade or pathological cases. 

6. Many of the non-verbal or performance tests at present available fail to 
discriminate among normal adults. Most of the tests of the Pintner Pater- 
son Performance Scale fall in this class. The data on these tests will not be 
useful in connection with further studies of normal adults, but have a 
definite value in studies of deteriorated adults whose level is low enough 
to require comparatively simple tests and whose deterioration may be esti- 
mated roughly by reference to the median and quartlles for normal adults. 

7. Other non-verbal tests, notably those of the Pintner Non-Language, 

1 Wciseoburg and McBride, op. fit.. 577-578. 
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arc more satisfactory in differentiating among “average” adults and con- 
tribute data on abilities which are important at adult as at childhood levels. 
There arc not enough of these tests, however, or rather they do not cover 
all the types of performance which should be studied, for example, form 
or picture analogies. 

8. The Goodenough Drawing Test presented an assignment unwelcome 
to many adults, and was undoubtedly a different type of problem for them 
than for the child, with success on the test much less closely related to in- 
telligence than among children. Experience with the test of Drawing a 
Chair indicated that this too was a difficult problem, and suggested that 
studies of drawing in adults run up against very variable factors, emotional 
as well as intellectual, and must be interpreted with caution. 

The analysis of the results for their bearing on the question of age and 
test performance is one of the most interesting sections of the work. The 
first conclusion is that by far the greatest extent of mental development as 
indicated by these test performances has occurred before the twenties, and 
that from this decade through the fifties there is litde further gain and com- 
paratively little decline. While the magnitude of the development before 
twenty fhr outweighs any change between twenty and sixty, smaller changes 
do occur in the adult period. As they appear in specific types of mental 
aedvities, these changes are comparatively little known, but are none the 
less important, for example, in estimating the relauve value of the worker 
in a given occupation at twenty and at fifty, or the value of educational 
projects for the “average man” in earlier or later decades. The results of 
this study show clearly that the peak of development and the course with 
age are different for different performances. A survey of the findings sug- 
gests two conclusions which may be briefly stated as follows : 

1. The abilities sampled by most of the soiled language tests are well 
maintained through the thirties and in the majority of cases show only 
slight declines thereafter. The abilities sampled by the vocabulary tests 
show little or no decline through the fifties. 

2 . The abiliues sampled by another group of tests show an early peak, 
probably falling either in the twenties or before, with successive declines 
in the thirties, forties, and fifties. These declines are not significant at any 
one-decade or two-decade interval, but most of them are significant for the 
interval between twenty and fifty or for the larger age groups twenty 
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through thirty-four and forty-five through fifty-nine. A point of great 
interest is that all but one of these tests are of the non-language type, while 
the tests which do not show a drop in mean score before the thirties are 
language or arithmetic tests. The only ''language” test in the group char- 
acterized by steady declines with age is the Printed Analogies, which re- 
peatedly manifested a close relationship to the non-language tests. Declines 
with age on some of the non-language tests probably reflected the generally 
accepted declines in simpler motor and perceptual abilities and particu- 
larly in those in which speed was a dominant factor. Declines in others, 
notably the Porteus Maze and the more complex tests of the Pintner Non- 
Language Scale, and also in the Printed Analogies, could not be explained 
on this basis. Furthermore, they were apparently not to be attributed to a 
greater difficulty among the older subjects in adapting to the test situations, 
for such difficulty was not observed in the situation itself. It is suggested 
rather that these tests probably involve spatial factors and that the declines 
arc to be understood on this basis. 

From the point of view of the relation of education and test perform- 
ance, the findings indicated that the amount of formal schooling was 
probably not such an important factor as it is generally supposed to be. An 
analysis of individual cases revealed that the intelligent men with poor 
school training had effected some compensation by adult life, while the dull 
individuals who had received more training in the fundamental school 
subjects than their intelligence warranted cither had never profited by it 
or had forgotten most of what they once knew. It would not be wise to 
lay too great stress on this point in view of the small size of the group, and 
also in view of the fact that at least one performance, arithmetic computa- 
tion, showed a slightly closer relation to education than to test intelligence. 

It is important to note, however, that scores on all the other educational 
achievement” tests analyzed bore a closer relation to the test intelligence 
than to the school grade completed, and that this relationship hefd even 
for the spelling tests, the skills involved in which are definitely school 
trained. The results furnish additional proof, though such is hardly neces- 
sary, of the wisdom which lies in studying the mental abilities of the school 
child and making the nature and extent of his training fit his potentialities 
for development. 

A study of test performance in relation to the occupational classes of the 
Taussig Scale indicated a condition which is easily accepted on the basis of 
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vtously involved verbal material, but is apparendy as closely related to the 
arithmetic and non-verbal as to the verbal tests. The Arithmetic Compu- 
tation Test also occupies a sort of mid-way position. 

The evidence for a distinction between different groups of mental 
abilities gains additional support from the findings for the pathological 
groups studied by the same tests. 2 Cases of right-sided cerebral lesion with- 
out aphasia or history of it were found to resemble the normal most closely 
in sentence dictation, oral spelling, and oral vocabulary, and to be signifi- 
cantly inferior to the normal only on the arithmetic and some of the non- 
language tests. These non-aphasic cerebral cases represent a sample in- 
ferior to the normal, but this condition does not explain the marked in- 
equalities in their performance levels. Language abilities are apparently 
well maintained while the abilities required by the arithmetic and most 
non-language tests show evidences of deterioration. 

The distinction between language and non-language is more pronounced 
in typical cases of aphasia; and here the opposite condition is to be found, 
that is, the language activities are more seriously affected than the non- 
language. Interestingly enough, the arithmetic activities usually stand in 
between. Some few patients whose language processes are extremely lim- 
ited or confused so that they speak and understand only a few words cor- 
rectly nevertheless do better than the average of the normal group on non- 
language tests, and may also do moderately well in arithmetic. Others, the 
great majority, show some deterioration in non-langu3ge activities and in 
arithmetic, but far less than in language. Only a few aphasic patients have 
anything like as great difficulty in non-language as in language work. In 
short, aphasia is a deterioration which usually extends beyond language 
processes and involves activities which do not require overt verbal responses, 
but the changes in non-language performances arc less marked than those 
in language and in some cases the differentiation is clear enough for one to 
say that the aphasic patient has lost one form of intelligent response, the 
verbal, and retained other forms which do not require language. 

Uncertain as reasoning from the pathological to the normal must be, it 
seems probable that the characteristics of cases of cerebral lesion, with and 
without aphasia, give certain new cues for an understanding of normal 
mental functioning. Despite the positive correlations found for the normal 
group between almost all the many activities studied and despite the fact 


* Ibid . chapj. XVI and XVIII. 
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that analyses of individual cases revealed few marked constellations of 
mental test performances, there are in all probability more or less inde- 
pendent groups of mental abilities. The typical normal adult docs not show 
great differences in the development of these different abilities; in so far as 
performances on tests of intelligence and educational achievement may be 
taken as indices, the various abilities all fall at fairly similar levels. In cases 
of brain disease, however, or at least in cases of localized brain disease, they 
may be affected unequally. As to the nature of these groups of mental 
abilities and their interrelationships, there is still little precise knowledge 
and it may well be that groups as yet undefined will have to be postulated 
to explain results such as those found in this research. For the present it 
seems dear from this study and others, however, that there is probably a 
verbal ability of a rather extensive nature. Numerical and spatial abilities 
have also been indicated, particularly in Kelley's work, and the necessity 
for some such groups as these is suggested by the findings for both normal 
and pathological subjects. 
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DESCRIPTION OF THE TESTS 


1. Language Intelligence Tests 

a. Controlled association tests 

b. Kelley Trabue Completion Exercise Beta 

c. Vocabulary tests 

d. Stanford Binet Scale 

e. Miscellaneous tests 

2. Educational Achievement Tests 

a. Reading tests 

b. Spelling tests 

c. Writing a Letter 

d. Arithmetic tests 

3. Non-Language Tests 

a. Tests from the Pintner Paterson Performance Scale 

b. Pintner Non-Language Mental Test 

c. Cancellation 

d. Immediate Recall: Kuhlmann Binet 

e. Drawing a Chair 

f. Goodenough Drawing 

g. Porteus Maze Test 


I. LANGUAGE INTELLIGENCE TESTS 
a. Controlled association tests 

opposites tests. The Opposites Tests used were chosen for the aphasia study 
and were too easy for normal adults. 1 

mixed analogies tests. Two forms of the Van Wagenen Graded AmIog.es 
Tests were selected for the Oral and Printed Analogies Tests, B for the Oral 
and A lor the Printed. 2 , ,, ■ , 

The instructions were approximately the same as those ou me y an 
Wagenen, except that the subject was asked to give his response orally. 

I am going to say three words. I want you to tell me a fourth word that has 
1 For lists of tumulus word, and procedure tec Wcitcnburg and McBride, op. tit.. 581- 
1 M. J. Van Wagenen, Grated Opposite r tad dnalogie s Tests, J. Educ Psyehot, 19*0, n. 



LANGUAGE INTELLIGENCE TESTS 131 

the same relation to the third word I say as the second has to the first. Here’s 

an example: color is to red as name is to John. Here’s another: page is 

to boo\ as handle is to \nife. See if you can tell me the fourth word for 

this one: chew is to teeth as smell is to . (If the subject could not tell the 

fourth word, it was told him, and the directions were given again.) 

The successive items of the Oral Test were then read aloud, and the subject 
gave his responses orally. If both Oral and Printed Tests were used, the latter 
was given second and with no further instructions except that the subject was 
asked to read the words instead of listening to them. He responded orally. If the 
Primed Analogies were used alone, printed directions were given and for con- 
venience these were placed at the top of the test sheet. 

Both tests were timed, but the score was taken as the total number of correct 
responses. 

As given, each test included 50 items 3 but some of these were of doubtful 
value, either because subjects sometimes misunderstood them or because of diffi- 
culty in assigning a plus or minus score. When doubtful items had been elimi- 
nated the Oral Test contained 36 analogies, the Printed, 40. Analysis of the diffi- 
culty value of each item showed some overlapping, that is, two or more items 
falling at the same difficulty value. One item of such a group was cut from the 
Oral Test, and five from five different groups from the Printed. Each scale then 
contained 35 Items. The two lists, each ranked in order of difficulty, are given be- 
low, together with the accepted response or responses to each analogy. 


Oral Analogies Test 

1. horn is to blow as bell is to (ring, clamor) 

2. rain is to summer as snow is to (winter) 

3. vinegar is to sour as sugar is to (sweet) 

4. work is to day as sleep is to (night) 

5. baker is to bread as bees are to (honey) 

6. rug is to floor as pictures arc to (wall) 

7. iron is to heavy as aluminum is to (light) 

8. year is to month as week is to - (day) 

9. high is to low as near is to (far) 

10. stove is to heat as lamp is to (light) 

it. raise is to tower as open is to (shut, close) 

la. eat is to food as wear is to (clothes) 

13. country is to road as city is to - (street) 

14. under is to over as down is to - (“P> 

15. navy is to sailors as army is to — (soldiers; 

16. foot is to leg as hand is to (arm) 


» One change on the Oral Form was made before the test " 

tine: " was changed to "knife • blade ° • 

problem. "Prong” and "points'' were accept as ry 


given: Item "blade : kniic :: 
i to simplify the vocabulary 
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17. sweet is to taste as red is to (color) 

18. silver is to tarnish as iron is to (rust) 

19. front is to back as top is to (bottom) 

20. hard is to soft as rough is to (smooth) 

21. triangle is to three as square is to (four) 

23. fair is to cloudy as sunshine is to (rain) 

23. fruit is to basket as water is to (pail, bucket, bottle, pitcher) 

24. box is to wood as botde is to (glass) 

25. wall is to paper as floor is to (carpet, rug, paint, varnish, stain) 

26. coal is to black as gold is to ■■ (yellow) 

27. foot is to ankle as hand is to (wrist) 

28. man is to legs as carriage is to (wheels) 

29. leg is to knee as arm is to (elbow) 

30. kettle is to utensil as chair is to (furniture) 

31. park is to gate as house is to (door) 

32. knife is to blade as fork is to (tine, prong, points) 

33. number is to figures as word is to (letters) 

34. grain is to wheat as fruit is to (any kind of fruit) 


35. house is to door as field is to (gate) 


Printed Analogies Test 

Write a fourth word that has the same relation to the third word in the 
row that the second word has to the first word in the row: 
color red name JOHN 
page book handle KNIFE 
chew teeth smell 

1. birds fly fish (swim) 

2. summer hot winter (cold) 

3. light flay dark (night) 

4. sun shines wind (blows) 

5. Ether son mother (daughter) 

6. hat head shoe (foot, feet) 

7. barn hay library (books) 

8 . cup saucer knife (fork) 

9. cats scratch bees (sting) 

10. food eat books (read) 

11. hair black eyes (blue, brown, black, etc.) 

12. man husband woman (wife) 

13. thermometer temperature clock (time) 

14. clothes tear dishes (break) 

15. water drink air (breathe) 

16. lamp o 3 stove (coal, fuel, wood) 

17. happy laugh sad (cry, weep) 

18. potato s egctable seal (meat) 
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19. far near there (here) 

20. sparrow bird mosquito (insect, bug) 

21. bread flour candy (sugar, molasses) 

22. city mayor state — (governor) 

23. water glass coffee (cup) 

24. sight blind hearing (deaf) 

25. July month Friday (day) 

26. slipper shoe cap (hat) 

27. cheap many cosdy (few) 

28. air birds water (fish) 

29. complex difficult simple (easy) 

30- month week day (hour) 

3 1 * present known future (unknown) 

32. mail write telephone (talk) 

33- picture frame field — (fence, wall) 

34. victory defeat success (failure) 

35* work problems play (games) 

b. Kelley Trabue Completion Exercise Beta 
The Kelley Trabue Completion Exercise Beta was given and scored accord- 
,n S to Kelley’s revision. 4 It was suitable for almost all the subjects of the normal 
group, with the exception of a few who had great difficulty in reading, and for 
these it was not used. 


c. Vocabulary tests 

Stanford binet vocabulary test. This test was given and scored in the regu- 
lar way. 5 No evidence that subjects had had experience with this test before was 
ever found, although it has been published in popular magazines and news- 
papers. 

Thorndike test of word knowledge. This test, a multiple choice type of vo- 
cabulary test, was given and scored according to Thorndike’s directions.® It was 
not used in the case of those few subjects who had great difficulty in reading. 

d. Stanford Binet Scale 

The Stanford Binet Scale was given and scored according to Terman’s dtrec- 

4 Kelley, InJittdual Testing with Completion Test Exerciser. Teach Coll. Rec . 1917, 18, 
371-382. 

When the study was begun the Completion Exercises could be obtained from the Teachers 
College Bureau of Publications When they went out of print shortly afterward, the tests were 
nwltigraphed, but they may be had Com C. H Smelting and Company, Ch.cago 

* Tecmm, The Measurement of MelUgnce , 9l 6, 224-23 1. 

•Published by Bureau of Publications. Teachers College, Columbia University. A sheet of 
directions for giving and scoring the test comes with the test blanks. 
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dons. 7 A complete test was always made but the Code Test, Year XVI, as noted 
in Chapter IV, was usually omitted because it was not well received by the adult 
subjects. All tests were given from the X-year level through the level at which 
no test was passed; in cases of one or more failures at X years, the earlier tests 
were given down to the age at which all tests were passed. In no case did the 
testing have to be extended below the V-year level. 

e. Miscellaneous tests 

absurdities tests. These were composed of absurd statements, chiefly from 
among those suggested by Ballard. 8 They were divided into two groups by select- 
ing alternate items; one group was presented orally and the other in print. 
After a few trials the numbers in each group were cut to 12, with the elimination 
of those items which caused difficulty. When all the tests had been made, the 
responses were scored by the three psychologists working together. Two more 
items were cut, one from each series; both of them were often misleading to 
good as well as poor subjects. 

The following directions were used in giving the Oral Test: 

I am going to read you a paragraph that has something foolish about it, 
something absurd. When I’ve finished, I want you to tell me what is foolish 
about it. What is foolish about this sentence? “A soldier in the march com- 
plained that every man was out of step except himself” (If the subject could 
not indicate the absurdity, it was explained to him.) 

If the Printed Test were given later no further directions were added. For 
convenience in cases where the Printed Test alone was used, similar direction* 
were added at the top of the printed page. 

Oral Absurdities Test 

x. A man asked a boy where Mr. Smith lived. He said: “The first house you 
come to is a barn and the next is a haystack — the next is Mr. Snu * . 

2. A gentleman fell from his carriage and broke his neck, but receis no 

further damage. ... «. 

3. The judge said to the prisoner: "You are to be hanged, an opc 
will be a warning to you.” 

4. When there is a collision the last car of the train is 

so the guard thinks it would be best if the last car were : 
the train starts. . . .« r . 

5. A boy wrote on his composition: "Soap smells nice, but it tastes orri 
tastes worst of all when you get it in your eye.” 

•P. B. rhe Growth of InttWgenct. Brit. I- Piychol, Geo. Sec, ijai, 

13 , 135-141. , 

Ballard quote* one of Whipple'* and one of Yerke*' mu, both of which were ujeu. 


usually damaged most, 
alwavs taken off before 
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6. An old gentleman complained that he could no longer walk around the 
park as he used to; he could now go only halfway round and back again. 

7. A householder saw an advertisement: "Buy one of Simkin’s stoves and 
save half your coal." He bought two in order to save all of it. 

8. I am not conceited, but I don’t think I'm half as clever as I really am. 

9. You are thin 3nd I am thin, but he is thinner than both of us put together, 
to. A showman advertised for 3 giant and a dwarf A man of ordinary height 
presented himself and offered to fill both parts. He claimed to be the smallest 
giant in the world and the biggest dwarf 

11. Every rule, even this one, has an exception. 


Printed Absurdities Test 

Read this sentence and decide what is foolish about it. 

A soldier in the march complained that every man was out of step except 
himself 

What is foolish about each of these paragraphs? 

1. I like end slices of bread. I gave the girl a whole loaf of bread and told 
her to bring me the two end slices. I afterward found that she had sliced the 
entire loaf I asked her why she did this. She said: “How could I get the second 
end piece unless I did?” 

2. I received a letter from a friend in which he said: “If you don’t get this 
letter, just let me know and I’ll write again.” 

3 - I read in the paper that they fired two shots at a man. The first shot killed 
him but the second one didn’t. 

4. This morning I met a smart young man. He was walking down the street 
with his hands in pockets and twirling a brand new walking stick. 

5 * An Irishman called one day at the post office and asked if there was a 
letter waiting for him. “What i5 your name?” asked the postmaster. “Sure," 
said the man, “you will find my name on the envelope.” 

6. It is said that a certain town in Greece contains two relics of St. Paul; one 
his skull when he was a boy and the other h.s skull when he was a man. 

7 - A teacher said to his boys: “Tomorrow we will have an examination, at- 
tendance at which is voluntary; so if any boy is absent he d better look out. 

8. A man said to his shoemaker: “You blockhead’ I told you to make one 
of the shoes larger than the other, and instead of that you have made one of 
them smaller than the other.” 

9 - There is a tree in too so « o 1 ” a “ <*“ 

,o P T!,c three men laughed, then .'°PFd -uddenly as the eyes of each met 

those of the others across the table- . r 

... The horse obeys his master because h.s eyes magn.fy so that h.s master 
seems to the ho.se to be much huge. •>“ h °'“ himself 
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The items were first scored by allowing two points for a correct response and 
one point for a partially correct response, but differentiations between the cor- 
rect, the partially correct, and the zero responses were difficult, and a plus or 
minus scoring system was finally adopted. In general correct answers are those 
stating the absurdity, those correcting the absurdity, or commenting upon it in ' 
such a way as to show its clear recognition, of, less good, answers definitely 
implying the absurdity. Incorrect answers represent lor the most part incorrect 
criticisms or interpretations, irrelevant comments, answers denying the ab- 
surdity, answers restating it, or complete failure to respond. Examples of answers 
considered correct are given below. 


Scoring Guide for Oral Absurdities Test 

1. How could a haystack be a house? 

Why didn't he say the first house was where Mr. Smith lived? 

The first “house” would not be a house; it would be a barn or a haystack. 
1. He’d be dead if he broke his neck. 

That' s damage enough. 

3. What was the object of warning him if he was to be hanged? 

Too late to warn him. 

He didn’t need a warning. He’d be a warning to others. 

That was his finish. 

He expected him to live, I suppose, after he was hanged! 

4. There would always be a last ear. 

If the last car is taken off, the car next to the last would be wrecked. 

You would have no cars on the train then. 


5. Can’t taste in your eye. 

It burns your eye. 

6. That is the same as all the way round. 

He was still able to walk around just as far. 

7. He was burning twice as much. 

He’d have to burn some coal if he wanted heat. 

He should have bought three stoves to get some back! 

8. He is conceited when he says he thinks he’s clever. 

He is contradicting himself there. 

He is conceited. 

9. Two put together would be fatter not thinnef- 

He couldn't compare the mo of os; we would ™ke Urger •>>» •>' “• 
He would have to be fat then. . . 

Comparing the thinness of one person to two >s idiotic. . 

10. Impossible for a man of normal stature to assume at will the p po 

of a dwarf and a giant. .... t c u. 

If he wants a giant he must get a big man, and if he wants a 
must get a small man. 
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He was neither. 

He couldn’t be both at the same time, 
il. It would have to indude that one; then every rule docs not have an 
exception. 

If every rule, then that one the same thing, and then it’s not true. 

Scoring for Printed Absurdities 

i. She should have gotten both end pieces by slicing both ends off, not by 
cutting the whole loaf. 

Foolish to cut the whole loaf. 

All she had to do was to turn both ends around. 

а. You would have to get the letter to let him know. 

How could the person know if they didn't receive the letter? 

3. He was already dead with the first shot. 

He couldn't be killed twice. 

The second one couldn’t. 

They would not have to shoot the second shot. 

Not necessary for a second to kill him after one did. 

4. Impossible to twirl a stick with his hands in his pockets. 

5. The postmaster could not give the Irishman his letter if he did not know 
his name. 

He would not know what name to look for. 

He wouldn’t be able to tell which envelope. 

б. Two skulls for St. Paul; that’s one too many. 

There couldn’t have been two skulls. 

St. Paul lived to be a man. Couldn’t get skull when he was a boy. 

St. Paul had only one skull and the relic part is a fake. 

7. If attendance is voluntary, why need the boys look out? 

Voluntary is foolish; it was really compulsory. 

That was a threat; it wasn’t voluntary. 

Voluntary means to do what you please, so if a boy is absent he would 
be doing what the teacher said. 

8. That’s the same thing. 

Just made the job right. 

He did what he told him to do already, if he did not tell hun which shoe 
to make larger. 

One would be smaller anyway. 

9. If any of the three had ordinary eyesight, they could see the top alone. 

It would not take but one person to see the top of the tree. 

The boy could sec as high as two men and a boy. 

Why does it take three people to sec the top? That s foolish. 

How do they relay that seeing game? 

10. Three men’s eyes could not meet all at the same time. 
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How see two other pairs at once? 

They would have to be crosseyed. 

Their eyes could not meet. 
ii. "Would magnify himself too. 

Horse cannot see himself to compare with his master. 

pintner-toops revised directions TEST. This test was useful for certain ab- 
normal cases as a quick index of reading ability. 9 Interpretation of the results 
was sometimes difficult, however, because for two items at least the subject could 
get enough cues from the non-verbal test situation to make the correct response. 
For normal subjects the test is not simply an index of reading ability. 


2. EDUCATIONAL ACHIEVEMENT TESTS 
a. Reading tests 

gray oral readinc psragraphs. Sets I through IV covered the necessary range 
of difficulty so well that the examiners used the particular set or sets best suited 
to the reading level of each subject. 10 This procedure made for efficient testing 
but the results were difficult to handle. As noted in Chapter IV, about 90 per 
cent of the group took Set IV, and the results for these subjects were presented, 
with the scores based on the total time for reading the 150 words of the test 
plus 1/50 of the total time for each error. 11 

THORNDIKE mccall readinc scale. This test was given and scored according 
to the author’s directions. 12 Form 1 was used for all the normal, subjects. No 
time limit was set, but with very few exceptions the subjects finished all they 
could do in twenty to thirty minutes. 

chapman unspeeded reading-comprehension test. This was selected espe- 
cially because the non-verbal response made it valuable in the aphasia study an^ 
it turned out to be well adapted in form and content to normal adult subjects. 
No time limit was set, but the subjects usually finished within thirty minutes. 


b. Spelling tests 

GATES ORAL spelling test. This test, composed of 36 w’ords, was scored in 
terms of the total number of words right on the first attempt or corrccte spon 
taneously by the subject 14 


* Published by C. H. Stoelririg. Chicago. , . 

Published by the Public School Publishing Company. Bloomington. Illinois. 

11 A number of these also took Set ItT, but standard errors were too ig 

„ from to „ tote, .6 ,8 for IV attend (to TO 9 •« for HI m- 


mated from IV. ... ■ ■ - 

»* Pub'ished by Teachers College Bureau of Publicauons, Columbia Uoirenity. A scoring 
sheet comes with the tests. - . , , . 

,s Published by Educational Test Bureau. 3416 Walnut Street, Phil p 
14 A. L Gates, The Improvement of Heading, 1917, 3®7 — 3®®' 
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Morrison mccall spellino scale. This test proved to be valuable for the ab- 
normal cases but on the whole too easy for the normal. 15 List 1 was used for the 
normal subjects. 

Stanford ACHIEVEMENT dictation test. The test used was the New Form 
W. 16 The sentences were begun at a level well below the subject’s ability, usually 
at the point where the sixth grade is supposed to begin, and continued until all 
three crucial words in two successive sentences had been failed. In the case of 
some few subjects this second condition was not obtained; for after repeated 
. failures, although not on six words in succession, they refused to continue the 
test. 

c. Writing a Letter 

In the course of the examination, each subject was asked to write a letter, to 
whomever he pleased and on whatever subject lie pleased. When all the letters 
had been collected, scale values for each were determined on the basis of the 
Nassau County Supplement to the Hillegas Scale. 17 Each scale value represented 
the value assigned by at least two of the three psychologists working inde- 
pendently, or in the comparatively rare cases in which three different values were 
assigned, the middle one of these. 

d. Arithmetic tests 

The Stanford Achievement Arithmetic Test was used in the old Form A; 
both sections, Computation and Reasoning, were given. 18 No time limit was set, 
but few subjects required more than fifteen to twenty minutes for each. 


3. NON-LANCUACE TESTS 

a. Tests from the Pintner Paterson Performance Scale 
Many of the ten tests of the Pintner Paterson Performance Scale which were 
used were poor in discriminating among the adults of the normal group, and 
only the best of them— the Mare and Foal, the Seguin, and the Substitution— 
have been reported. 19 All these were given and scored according to the Prntner- 
Paterson directions. 


b .Pintner Non-Language Mental Test 
The Pintner Non-Language Mental Test was given as an individual test, 

14 Morrison McCall Spelling Scale, Yonkers: World Boole . 

14 A sample set of the New Stanford Achievement Drctanon Test, cnatauun g the fores V, 

W. and X. may be obuioed from the World Book &mp»r. Yo^. 

»M. R. Trabue, Su n Umcnttn Z the ■„ I9,? ’ ,8 ’ 5I ~ Sa ' 

14 Tests and manual published by the World Book Company, Yonkers 
The old Form A was used because it bad been given to a number of the abnormal case, 
before the new form was published. 

16 R. Pintner and D. G. Paterson, A Scale o] 
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with the necessary demonstrations on paper rather than on a blackboard, but 
otherwise exactly according to Pintner's directions. 20 Pintncr’s weighting system 
was followed in obtaining the total score, but the data lor the various sub-tests 
in terms of raw score were also analyzed separately. 

c. Cancellation 

The Cancellation Test was one of a small group of relatively simple tests 
tried out with the abnormal cases, and the only one used with the normal sub- 
jects. 21 The score was taken as the number of a's correctly crossed in one and a 
half minutes. 


d. Immediate Recall: Kuhlmann Binet 

The Test of Immediate Recall of Unfamiliar Forms was given and scored 
according to Kuhlmann’s directions, although for purposes of comparison with 
the abnormal cases time and error scores were also calculated separately. 22 

e. Drawing a Chair 

The test of Drawing a Chair was arranged for the abnormal cases, to study 
the ability to draw from a model. The chair was set at such an angle that the 
subject could sec the left rear leg between the right front and rear legs. If he 
started to draw the chair as seen straight from the front, he was allowed to con- 
tinue without comment. The type of chair always used was an upright chair 
without arms. 

The scoring system devised was modeled after that Goodenough arranged 
for the drawing of a man in that points were given for the essenu'al components 
of the drawing and for the skill and accuracy with which these were repre- 
sented. 23 


Scoring for Drawing of a Chair 
ra. Top line for back of chair 

b. Upright on one side of back 

c. Upright on the other side 

d. Middle slat, bar, or panel 
aa. Seat of chair 

b. Attachment of seat to back 
3a. Left rear leg 


*° Plainer, A Non-Language Group Intelligence Test, J. AppI Psychol , >9 >9' 3. >99~2*4- 
Test manual and booklets published by College Book Company. Columbus. Ohio. 

21 Published by C. H. Stoeltine Company, Chicago; Test No 

See A. Bronner, W. Hedy, el at., A Manual of Individual Mental Tests and Testing. 1927. 
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b. Left front leg 

c. Right front leg 
<L Right rear leg 

e. Correct position and angle of left rear leg 

f . Corrert position and angle of left front leg 

g. Correa position and angle of right front leg 

h. Correa position and angle of right rear leg 

i. Stretchers 

j- Correct number of stretchers 

It. Correa position of stretchers 
4 a. Proportion : back narrower tlian high 

b. Proportion: back longer than front legs 

c. Proportion: seat area less than that of back 

d. Proportion: legs roughly equivalent in length 
5a. Lines “reasonably firm and mostly meeting” 

b. Lines firm and meeting 
6a. Pcrspcaise indicated by shape of seat 

b. Angle of attachment of seat and back, indicating that scat and back are 
approximately at right angles 

c. L*gs showing no transparency, and correct superposition, i.e., of seat, 
legs, and stretchers 

Each drawing was scored independently by two of the three psychologists; 
moot points were then discussed with the third psychologist and a final decision 
reached. 

The 35 drawings were ranked by each of the three psychologists. The rank- 
difference correlation between the ascrage rank so daermined and the rank in 
order of score assigned was + .83 ±.03. 

f Goodenough Drawing 

The Goodenough Drawing Test was scored according to Goodenough’s di- 
rections. 54 As for the drawing of a chair, each production was scored by two of 
the three psychologists, working independently; moot points were discussed with 
the third and a final decision reached. The 66 drawings were then rated, first by 
the three psychologists, who ranked them in order of their quality as repre- 
sentations of a man, and then by three artists. The rank-difference correlation 
between the two rankings was surprisingly high, +-97- The correlation between 
the ranking assigned by the three artists and the order m terms of the Goode- 
nough Score was also high, -f--9 l * 

g. Porteus Maze Test 
The Porous Msze Tes! svss “<• “° I ' d 
34 ibid , 85-153. 
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